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PREFACE 


The first glimpse of the three-dimensional structure of a mem- 
brane protein was revealed in 1975, when Henderson and Unwin pub- 
lished their landmark analysis of bacteriorhodopsin at 7 Ä resolution 
(Henderson and Unwin, 1975). This study established the presence of 
seven rodlike features, identified as a-helices, in the membrane span- 
ning region of bacteriorhodopsin, consistent with expectations based 
on the behavior of polypeptides in nonaqueous solvents (see Singer, 
1962). To place this development in context with the overall status of 
protein structure determinations at that time, X-ray crystal structures 
were available for ~80 water-soluble proteins (Matthews, 1976) when 
Henderson and Unwin’s work appeared. During the intervening quar- 
ter century since this initial peek at bacteriorhodopsin, the structures of 
~60 membrane proteins have been determined (see http://www.mpibp- 
frankfurt.mpg.de/michel/public/memprotstruct.html). Correcting for 
the ~25-year offset, the rate of membrane protein structure determina- 
tions (Figure 1) over the 15-year period following the first high resolution 
structure of the photosynthetic reaction center (Deisenhofer et al., 1985) 
closely parallels the progress observed for water-soluble proteins after the 
myoglobin structure (Kendrew et al., 1960). As we approach +20 years 
since the solution of the reaction center structure, it is not a very bold 
extrapolation to predict that the number of solved membrane protein 
structures is poised to explode, much as the number of water-soluble 
protein structures did in the 1980s, ~20 years after the myoglobin struc- 
ture. For example, since the chapters for this review were commissioned, 
new structures have appeared for the chloride channel (Dutzler et al., 
2002), formate dehydrogenase (Jormakka et al., 2002), the vitamin Bu 
transporter (Locher et al., 2002), the multidrug efflux transporter AcrB 


xi 
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O water-soluble proteins 


X membrane proteins 


number of structures/year 





0 5 10 15 
time after first structure (years) 


Fic. 1. The number of water soluble (o) and membrane proteins (x) solved per 
year, relative to the date of the first structure determination in each class, myo- 
globin (Kendrew et al., 1960), and the photosynthetic reaction center (Deisenhofer 
et al., 1985), respectively. The data for water-soluble proteins are from Matthews 
(1976), while those for membrane proteins are from the website http://www.mpibp- 
frankfurt.mpg.de/michel/public/memprostruct.html. Adjusting for the ~25 year dif- 
ference, it is evident that progress in the structure determination of membrane proteins 
mirrors that experienced for water-soluble proteins. For perspective, there were 18,838 
total available structures in the Protein Data Bank as of October 3, 2002, with 3298 struc- 
tures deposited in 2001 (see http://www.rcsb.org/pdb/holdings_table.html). 


(Murakami et al., 2002), and the MscS mechanosensitive channel (Bass 
et al., 2002), among others. 

Although membrane protein structure will not likely become a mature 
field for another decade or two, one consequence of this recent surge 
in activity is that systematic coverage of all known structures is no longer 
possible. The eleven chapters in this volume review recent developments 
for selected membrane proteins from a variety of perspectives that em- 
phasize the blending of structural and functional approaches, with the 
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objective of establishing a comprehensive mechanistic framework for a 
particular system. 

The first three chapters address issues of general relevance to all mem- 
brane proteins, namely, their biosynthesis and the basic structural prin- 
ciples underlying their construction. Gunnar von Heijne describes the 
cellular mechanisms responsible for “functionalization” of the lipid bi- 
layer with proteins and the possible constraints imposed on membrane 
protein structure by this process. A detailed understanding of these 
mechanisms is not only of biological interest, but also of practical signifi- 
cance since the overexpression of eukaryotic membrane proteins looms 
as one of the greatest challenges to their structural characterization. 
James Bowie and co-workers address the energetic principles, including 
lipid-protein interactions, that drive the formation and stability of helix- 
bundle membrane proteins. As described in this chapter, the thermody- 
namic stability of membrane proteins and the energetic consequences 
of protein folding in a nonaqueous environment represent some of the 
challenging and outstanding problems in this field. While many mem- 
brane proteins consist of a-helices, those found in the outer membranes 
of bacteria are based on f-barrels of remarkable diversity. Georg Schulz 
provides an analysis of their construction principles, highlighting the 
implications of these observations for the engineering of channels with 
novel properties. 

Due to their high natural abundance, it is no coincidence that sig- 
nificant progress has been made in the characterization of membrane 
protein complexes that mediate energy transduction processes such as 
photosynthesis and respiration. P. Leslie Dutton and coauthors develop a 
comprehensive framework for analyzing energy and electron transfers in 
photosystems centered around the spatial organization of cofactors into 
chains. These considerations again have significant implications for the 
design principles of both biological and engineered systems. The “other” 
photosynthetic system, bacteriorhodopsin, has not only played a central 
role in the structural analysis of membrane proteins, but also in deci- 
phering the basic features of energy transduction processes. Hartmut 
Luecke and Janos Lanyi describe the exciting recent progress and the 
challenges in establishing the mechanism of proton pumping by bacte- 
riorhodopsin in molecular detail. The next two chapters describe the 
structure and mechanism of quinone binding respiratory complexes. 
Quinones play a central role in membrane bioenergetics, serving as dif- 
fusible, lipid-soluble carriers of electrons and protons that link many of 
the photosynthetic and respiratory systems in electron transfer chains. 
C. R. D. Lancaster reviews the Wolinella fumarate quinol reductase that, 
with the structure of the Escherichia colienzyme (Iverson et al., 1999), has 
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structurally defined the family of succinate:quinone oxidoreductases. 
This family is of central importance to the energy metabolism of many 
organisms, as various members can function in aerobic respiration (as 
complex II), the Krebs cycle, and as terminal acceptors during anaerobic 
respiration. So Iwata and co-workers describe the structure and function 
of two respiratory complexes, cytochrome bc; (or complex III) and cyto- 
chrome boz (or ubiquinol oxidase) that can couple electron transfer to 
proton translocation at the heart of energy conservation during respi- 
ration. These structures, together with the structures of cytochrome c 
oxidase or complex IV (Iwata et al., 1995; Tsukihara et al., 1995), ATP 
synthase (Abrahams et al., 1994; Stock et al., 1999), and photosystem I 
(Jordan et al., 2001), provide a nearly complete structural characteri- 
zation of the major photosynthetic and aerobic respiratory complexes, 
missing only the structure of complex I (NADH dehydrogenase) and 
the extension of the photosystem II structure (Zouni et al., 2001) to 
high resolution. 

Channels and receptors mediate the flow of matter and information 
across the membrane bilayer that are fundamental to many biological 
processes. The last four chapters of this volume address systems that high- 
light the rich functional diversity of membrane proteins in these capaci- 
ties. Microorganisms must be able to adapt to rapid changes in their en- 
vironment, such as sudden drops in external osmolarity that can lead to 
swelling and lysis. To protect against this, stretch-activated (mechanosen- 
sitive) channels of large conductance, first identified by Kung and 
co-workers (Martinac et al., 1987), are present in the cell membrane that 
appear to serve as safety valves to reduce the possibility of cell rupture 
under these conditions. Advances in the structure and mechanism of 
these mechanosensitive channels are reviewed in the chapter by my 
group. In eukaryotic organisms, voltage-gated channels mediate signal- 
ing processes that are of tremendous physiological and pharmacological 
significance. Francisco Bezanilla and Eduardo Perozo present an analysis 
of structural and dynamic properties of the voltage sensor derived from 
exquisite biophysical and biochemical studies. An important develop- 
ment will be the high resolution structure of a voltage-gated channel to 
define the structural organization of the voltage sensor, to complement 
the studies of MacKinnon in establishing the structural basis of ion selec- 
tivity (Doyle et al., 1998). G-Protein-coupled receptors are key elements 
of signal transduction pathways in eukaryotes and represent important 
pharmacological targets. Rhodopsin, that initiates the visual response, 
is the paradigm of this receptor family. Wayne Hubbell and coauthors 
describe the properties of rhodopsin as established from crystallo- 
graphicand biophysical studies, particularly the site-directed spin labeling 


PREFACE XV 


approach developed by his group. Of particular interest is the analysis of 
the light-mediated changes in the structure and dynamics of rhodopsin. 
In the final chapter, Robert Stroud and co-workers detail the structure 
and selectivity mechanism of the glycerol facilitator GlpF that is a mem- 
ber of the aquaporin family of channels. By mediating the flow of water 
and a few other small solutes such as glycerol across the relatively imper- 
meable cell membrane, aquaporins are critical to the maintenance of 
the appropriate osmotic pressure balance in prokaryotic and eukaryotic 
cells. 

At the start of the twenty-first century, the pace of membrane pro- 
tein structure determinations is clearly accelerating (Figure 1). With 
the exceptions of rhodopsin (Palczewski et al., 2000) and the calcium 
ATPase (Toyoshima et al., 2000), however, eukaryotic channels, trans- 
porters, and receptors are conspicuously absent from the list of known 
membrane protein structures. These two exceptions, as proteins of natu- 
rally high abundance, highlight the current reality that no structure has 
been determined for an overexpressed eukaryotic membrane protein. 
This situation reflects the present difficulties in the reliable overexpres- 
sion of membrane proteins, particularly those of eukaryotic organisms. 
Just as the development ~20 years ago of overexpression systems for 
water-soluble proteins revolutionized the structure determinations of 
this class of proteins, advances in membrane protein expression will be 
essential to successful realization of the goal of routine structural analysis 
of membrane proteins. 

In this era of proliferating reviews, investigators have many opportuni- 
ties to satisfy such urges, particularly in a field such as membrane protein 
structure and function. Consequently, I would particularly like to thank 
the authors of this volume for the time commitment and effort required 
to prepare their contributions. 

Douglas C. Rees 
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I. INTRODUCTION 


Although much current interest is focused on the structure-function 
relationships of membrane proteins, agood understanding of the cellu- 
lar processes responsible for the insertion of proteins into lipid bilayer 
membranes is a necessary prerequisite for assessing how much the acces- 
sibly “structure space” is constrained not by the bilayer per se but rather 
by the idiosyncrasies of the machineries that have evolved to guide the 
insertion process. In most cases, membrane proteins use the same tar- 
geting and insertion mechanisms used to sort soluble proteins between 
different cellular compartments but add an extra level of complexity: 
How are membrane proteins recognized as proteins that should be only 
partly translocated, leaving the transmembrane segments ofthe polypep- 
tide chain spanning the bilayer? Is the basis for this recognition the same 
in different translocation machineries, and, if not, how are such differ- 
ences translated into different constraints on the allowable structures in 
any given cellular membrane? 

In this article, I will review the current knowledge of how membrane 
proteins are handled by different targeting-translocation machineries 
from the perspective of possible structural constraints imposed by these 
machineries. As will become clear, this point of view has not been much 
elaborated up to now, and there are not many clear examples of struc- 
tural constraints that go beyond those imposed by the lipid bilayer itself. 
Nevertheless, I will argue that such constraints do exist and that they can- 
not be ignored if we wish to fully understand the principles underlying 
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membrane protein structure. For in-depth reviews of protein targeting 
mechanisms in general, a good up-to-date source is Dalbey and von 
Heijne (2002). 


II. OVERVIEW OF MEMBRANE PROTEIN ASSEMBLY PATHWAYS 
IN PROKARYOTIC AND EUKARYOTIC CELLS 


Over the past 30 years, the study of intracellular protein sorting has 
grown to a large and diversified field. A host of different sorting path- 
ways have been found in both prokaryotic and eukaryotic cells, and most 
of these can handle both soluble and membrane-bound proteins. 

The most well understood pathway is the one that delivers secretory 
and membrane proteins to the endoplasmic reticulum (ER) membrane 
in eukaryotic cells and to the inner membrane in bacteria. In both kinds 
of cells, the pivotal role is played by the so-called Sec61 (in eukaryotes) or 
SecYEG (in prokaryotes) translocon, a multisubunit translocation chan- 
nel that provides a conduit for soluble proteins to cross the membrane. 
The same translocon also serves to integrate membrane proteins into 
the lipid bilayer. 

Bacteria harbor additional inner membrane translocation machiner- 
ies, such as the “twin-arginine translocation” (Tat) system, the YidC sys- 
tem, and the so-called type II translocation systems that are dedicated to 
one or a small number of substrates and ensure their delivery into the 
extracellular medium. 

The Tat and YidC systems are also found in certain organelles in eu- 
karyotic cells. The former is present in the thylakoid membrane in plant 
cells, and the latter is found both in the thylakoid membrane (where it 
is called the Albino3 system) and in the inner mitochondrial membrane 
(where it is called the Oxalp system). Both chloroplasts and mitochon- 
dria appear to have unique systems for importing proteins across their 
outer and inner membranes, and these systems also handle membrane 
proteins. 

Finally, peroxisomes contain another unique machinery for protein 
import and membrane protein assembly. 

From this list of targeting-translocation machineries it is clear that 
membrane proteins from different subcellular compartments are not 
handled in the same way and thus may be expected to be under differ- 
ent constraints as concerns the requirements for membrane insertion. 
However, very few comparative data are available and the only system 
for which we have a detailed understanding of the insertion process and 
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how it constrains the final structure of the protein is the eukaryotic ER. 
The main focus of the chapter will thus be on ER targeting and inser- 
tion of membrane proteins, though the other systems will also be briefly 
reviewed. 


II. MEMBRANE PROTEIN ASSEMBLY IN THE ER 


For both secretory and membrane proteins, the initial event associated 
with ER targeting is the binding of the N-terminal signal peptide—or the 
most N-terminal transmembrane segment—to the signal recognition 
particle (SRP) (Johnson and van Waes, 1999). SRP binding is thought 
to slow down the rate of elongation, giving the ribosome-nascent chain- 
SRP complex more time to find an empty translocation site on the ER 
membrane. These sites are composed of the SRP receptor complex, 
the basic Sec61 translocon (an oligomer of the Sec6la, ß, and y sub- 
units), the TRAM protein, the oligosaccharyl transferase (OST) com- 
plex, and the signal peptidase (SPase) complex. On binding to the SRP 
receptor, SRP dissociates from the ribosome, allowing it to dock to the 
Sec61 translocon and to insert the N-terminal part of the nascent chain 
(including the signal peptide) into the translocation channel. As the 
elongation rate picks up again, the nascent chain is extruded cotrans- 
lationally through the long conduit thus assembled from the nascent 
chain tunnel in the ribosomal large subunit and the Sec6l transmem- 
brane channel (Beckmann et al., 2001). The signal peptide is cleaved by 
SPase at an early stage of translocation, and Asn-linked oligosaccharides 
are attached to the growing nascent chain as it passes in the vicinity of 
the OST. 

How is this process modified for membrane proteins? The major dif- 
ference is that the ribosome-Sec61 translocon complex somehow can 
recognize sufficiently hydrophobic segments in the nascent chain, either 
when they enter the Sec61 channel, or possibly already when they move 
through the ribosomal large subunit tunnel. The recognition of a hy- 
drophobic transmembrane segment by the ribosome-translocon com- 
plex leads alternately to the channel closing at its lumenal end and open- 
ing at the cytoplasmic ribosome-Sec61 junction or the opposite—closing 
of the cytoplasmic junction, opening at the lumenal end (Hamman ei al. 
1998; Liao et al., 1997). In the simplest model based on these observa- 
tions (Fig. 1), the signal peptide (or the most N-terminal transmembrane 
segment) thus opens the lumenal translocon “gate,” allowing free pas- 
sage of the immediate downstream part of the nascent chain across the 
membrane, the next transmembrane segment closes the lumenal gate 
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ER lumen 





Fic. 1. Model for membrane protein insertion into the ER membrane. In stage I, 
an N-terminal signal peptide (not shown) has already initiated translocation across the 
membrane. 


and opens up an exit route toward the cytoplasmic side for the following 
part of the nascent chain, the succeeding transmembrane segment again 
opens the lumenal gate, etc., effectively “stitching” the transmembrane 
segments into the membrane one by one. 

Obviously, the transmembrane segments must at some point be ex- 
pelled laterally from the translocation channel into the surrounding 
lipid bilayer. The available information on this crucial step is limited, 
but it has been shown by rather sophisticated cross-linking techniques 
that a transmembrane segment appears to follow an ordered exit path- 
way where it is first found in the vicinity of the Sec6l« protein, then 
TRAM, and finally lipids (Do et al., 1996). Some transmembrane seg- 
ments move to a lipid-exposed location almost immediately on entering 
the Sech) channel, whereas others may remain within the channel for 
a longer time, possibly until a downstream transmembrane segment ap- 
pears whereupon the two may exit en bloc (Heinrich et al., 2000). It has 
even been claimed that multiple transmembrane segments or even the 
whole protein may assemble within the translocation channel before 
moving away from the translocon (Borel and Simon, 1996a,b). A better 
definition of the exit step in the assembly process will be crucial for our 
understanding of membrane protein folding in vivo. 

If our knowledge of the molecular mechanisms responsible for the 
recognition and membrane integration of transmembrane segments is 
thus quite limited, the situation is significantly better when we ask about 
the substrate, i.e., the characteristics of the nascent chain that have an im- 
pact on the assembly process. Starting with the hydrophobic transmem- 
brane segment itself, many studies have shown that there is a minimum 
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hydrophobicity threshold required for membrane integration (Kuroiwa 
et al., 1991; Sääf et al., 1998); to a first approximation, the precise amino 
acid sequence is thus unimportant (but see below). As an example, a 
stretch of 7-8 consecutive Leu residues is sufficient for the formation of 
a transmembrane topology. The threshold hydrophobicity (measured 
over a 20-residue window) appears to be close to that of poly(Ala) 
(Kuroiwa et al., 1991), as has also been suggested by calculations based on 
the insertion of synthetic peptides into lipid bilayers (Jayasinghe et al., 
2001). Nevertheless, it is not possible to cleanly discriminate all known 
transmembrane segments from nontransmembrane segments based on 
threshold hydrophobicity alone, suggesting that there is more to the 
story than simple greasiness. 

Transmembrane segments are connected to one another by loops 
(anything from short turns to large domains) of the polypeptide chain, 
and these have been found to have a strong influence on the assembly 
and final topology of membrane proteins. In multispanning membrane 
proteins, loops tend to be short [typically ~10 residues (Tusnady and 
Simon, 1998)]. For short loops, there is a very strong asymmetry 
in the content of positively charged residues (Lys and Arg), with 
cytoplasmic (“inside”) loops being 2- to 4-fold richer in such residues 
than extracytoplasmic (“outside”) loops (Sipos and von Heijne, 1993; 
von Heijne, 1986; Wallin and von Heijne, 1995). This so-called positive 
inside rule holds for proteins from nearly all organisms and intracellular 
membranes (Gavel et al., 1991; Gavel and von Heijne, 1992; van de 
Vossenberg et al., 1998; Wallin and von Heijne, 1998), and the only clear 
exception found so far is the class of mitochondrial inner membrane 
proteins that are imported by using a “stop-transfer” mechanism (see 
below). Mitochondrial inner membrane proteins that are synthesized 
within the organelle or those that are imported using the “conservative 
sorting” mechanism do seem to follow the positive inside rule, however. 

Numerous examples now exist where positively charged residues 
have been shown to influence the topology of both single- and 
multispanning membrane proteins. It has even been shown that a 
strongly hydrophobic segment can be prevented from inserting across 
the membrane if it is flanked by positively charged residues on both 
ends (so-called “frustrated” topologies (Gafvelin et al., 1997; Gafvelin 
and von Heijne, 1994)), and, vice versa, that a polar segment of the 
polypeptide chain can be forced into a transmembrane disposition by 
flanking hydrophobic segments that both have the same orientational 
preference (Ota et al., 1998). Recently, examples of “topology evolu- 
tion” have been found, where two homologous proteins with the same 
number of transmembrane segments adopt opposite orientations in 
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the membrane as a result of different distributions of positively charged 
residues in their loops (Sääf et al., 1999, 2001). 

Despite much work the molecular basis for the positive inside rule 
is still not clear, although both the membrane potential and lipid 
composition have been shown to affect the degree to which positively 
charged residues affect topology (Andersson and von Heijne, 1994; 
van Klompenburg et al., 1997). Given the ubiquitous nature of the pos- 
itive inside rule—it applies across all domains of life, in different or- 
ganelles, and in microorganisms from very different ecological niches— 
it seems unlikely that it can be fully explained by the thermodynamics 
of protein-lipid interactions. Rather, it probably finds its basis in a ki- 
netic difference between the rate of translocation of positively versus 
negatively charged polypeptide segments across biological membranes, 
perhaps modulated by the intrinsic properties of different kinds of 
translocons. 

Positively charged residues were the first topological determinants to 
be discovered, but they are not the only ones (Goder and Spiess, 2001). 
Thus, at least for single-spanning membrane proteins with an N-terminal 
signal-anchor sequence, the length of the hydrophobic stretch also helps 
determine its orientation. Long hydrophobic segments favor an Nout-Cin 
orientation, and short segments have the opposite preference (Wahlberg 
and Spiess, 1997). Finally, the Nou-Cin orientation for signal-anchor 
sequences is prevented if the N-terminal tail preceding the hydrophobic 
segment folds too fast (Denzer et al., 1995). 

As noted above, more than one transmembrane segment may be 
present within the translocon at any one time, suggesting the possibil- 
ity that transmembrane segments may affect one anothers topological 
preferences. Indeed, many studies have demonstrated that downstream 
transmembrane segments can influence the insertion of upstream seg- 
ments (Monné et al., 1999a; Nilsson et al., 2000). In general, it seems 
reasonable that the possibilities for such interhelix effects should be 
maximal when the two transmembrane segments are located very close 
to each other in the primary sequence. Indeed, such closely spaced trans- 
membrane segments may in fact act as one “insertion unit.” The term 
“helical hairpin” (Engelman and Steitz, 1981) is often used to describe 
this situation. 

We have studied the sequence determinants for helical hairpin for- 
mation during the insertion of a model membrane protein into the 
ER membrane. To simplify the problem, we engineered a 40-residue 
long poly(Leu) stretch into a membrane protein that inserts readily 
into ER-derived microsomes when expressed in vitro (Fig. 2A). Asn- 
X-Thr acceptor sites for N-linked glycosylation were used as topo- 
logical markers, as they can only be modified when located in the 
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Fic. 2. Model proteins used to study helical hairpin formation. (A) The H2 trans- 
membrane segment in the leader peptidase protein was replaced by a 40-residue long 
hydrophobic segment composed of 39 leucines and one valine. An Asn-Ser-Thr glycosyla- 
tion acceptor site (Y) was used as a topological marker (this site can only be glycosylated 
if present in the ER lumen). A single Leu > Pro mutation in the middle of the poly(Leu) 
segment is enough to convert it into a helical hairpin. (B) Same as in A, but with the 
poly(Leu) segment inserted into the normally lumenal P2 domain. In this case, two glyco- 
sylation acceptor sites were used to determine the topology of constructs where different 
amino acids were inserted in the middle of the poly(Leu) segment (arrow). Filled Y’s 
indicate glycosylated acceptor sites, unfilled Ve indicate nonglycosylated acceptor sites. 


lumen of the microsomes. The poly(Leu) stretch was found to insert 
as a single transmembrane segment, even though it is twice as long 
as “normal” transmembrane helices. However, single Leu— Pro mu- 
tations efficiently converted the single transmembrane segment to a 
helical hairpin when more than 15 residues distant from either end 
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of the poly(Leu) stretch (Nilsson and von Heijne, 1998). Similar ef- 
fects were seen when charged and polar residues were substituted for 
Leu, but not when apolar and hydrophobic residues were introduced 
(Monné et al., 1999b; Monné et al., 1999c). We also found that the 
poly(Leu) stretch has to be longer than 30 residues to allow Pro- 
induced helical hairpin formation (Monné et al., 1999c). Finally, blocks 
of charged residues placed immediately downstream of the poly (Leu) 
stretch also promote the formation of a helical hairpin topology 
(Hermansson et al., 2001). 

Interestingly, it appears that it is “easier” to induce the formation of a 
helical hairpin with the tight turn on the lumenal side of the ER mem- 
brane than one with the opposite orientation (cytoplasmic turn); i.e., 
whereas a single Pro is enough to convert the 40-residues long poly (Leu) 
stretch to a helical hairpin with a lumenal turn, three consecutive pro- 
lines are needed for a helical hairpin with a cytoplasmic turn to form 
(Sääf et al., 2000) (Fig. 2B). If one only considers simple protein-lipid 
interactions there is no obvious thermodynamic reason why this should 
be so; instead, we favor the view that this reflects a constraint on helical 
hairpin structure imposed by the Sec machinery. 

Finally, there is one particular class of membrane proteins for which 
special rules seem to apply: the so-called tail-anchored proteins. Tail- 
anchored proteins have only a single hydrophobic segment close to their 
C-terminus that inserts into the ER membrane with the C-terminus pen- 
etrating to the lumenal side. One consequence of this is that there is 
no N-terminal targeting information, and the cotranslational SRP path- 
way hence cannot be brought into play. Instead, tail-anchored proteins 
are targeted posttranslationally to the ER and do not appear to use the 
normal Sec61 translocon. The mechanistic details of the targeting and 
membrane insertion processes are largely unknown; however, in con- 
trast to the SRP-Sec61 pathway, tail-anchored proteins depend on ATP 
rather than GTP for membrane insertion (Kutay et al., 1995). Interest- 
ingly, the “minimum hydrophobicity” of the transmembrane segment is 
similar to what is required for membrane insertion via the SRP-Sec61 
pathway (Whitley et al., 1996). 

In conclusion, the available studies on membrane protein assembly 
into the ER suggest that the ribosome-translocon channel is where hy- 
drophobic segments are first recognized, eventually leading to their ex- 
pulsion into the lipid bilayer as transmembrane helices. Orientational 
preferences are largely encoded within the regions immediately flank- 
ing the hydrophobic stretches, even if the length of the hydrophobic 
stretch can play a role as well. Pairs of closely spaced transmembrane 
segments—“helical hairpins”—may behave as “insertion units” that are 
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recognized en blocby the translocation machinery. Finally, although most 
of the general structural aspects of helix bundle membrane proteins 
appear reasonably consistent with the basic principles of protein—lipid 
thermodynamics, there are some details where constraints imposed by 
the translocation machinery seem to be responsible: the positive-inside 
rule is one such instance (and, inter alia, “frustrated” topologies where 
hydrophobic segments are prevented from crossing the membrane as 
well as topologies where polar segments are forced into a transmem- 
brane disposition), and the different requirements for the formation of 
helical hairpins with lumenal versus cytoplasmic loops are another. 


IV. MEMBRANE PROTEIN ASSEMBLY IN Escherichia coli 


As far as is known, the basic rules for membrane protein insertion into 
the inner membrane of E. coli are quite similar to those discussed in the 
preceding section. Thus, most inner membrane proteins are thought to 
be targeted to the membrane by the bacterial SRP homologue, and in- 
serted via the SecYEG translocon (de Gier and Luirink, 2001). Whether 
membrane insertion is co- or posttranslational is not really clear, how- 
ever. For inner membrane proteins with large periplasmic domains, the 
SecA ATPase seems to be required in order to translocate these large 
parts but possibly not for shorter loops (Andersson and von Heijne, 
1993; Gafvelin and von Heijne, 1994; Sääf et al., 1995). 

A new inner membrane protein, YidC, has been shown to be crit- 
ical for the assembly process (Samuelson et al., 2000). YidC is found 
associated with the SecYEG translocon, but may also exist as a separate 
complex (Scotti etal., 2000). Homologues to YidC are present in yeast mi- 
tochondria (the Oxalp protein) and thylakoids (the Albino3 protein), 
where they are also involved in membrane protein assembly (Hell et al., 
1998; Luirink et al., 2001; Sundberg et al., 1997). Interestingly, mitochon- 
dria from the yeast Saccharomyces cerevisiae lack the SecYEG components, 
strongly suggesting that YidC can function in the absence of a Sec ma- 
chinery (Glick and von Heijne, 1996). Indeed, even in E. coli there are 
proteins such as the M13 procoat protein that appear not to use SecYEG 
but only depend on YidC (Samuelson et al., 2001). 

Concerning how hydrophobic transmembrane segments are recog- 
nized by the SecYEG-YidC translocon, not much is known. A cross- 
linking study using a protein with a single transmembrane segment has 
revealed an apparently ordered process where the hydrophobic segment 
is first found in the vicinity of SecY, then YidG, and finally lipids (Urbanus 
et al., 2001; van der Laan et al., 2001). This is very similar to what has 
been found for the Sec61-TRAM complex in the ER (see above) and 
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suggests that YidC and TRAM, although not related by sequence sim- 
ilarity, may nevertheless perform similar functions. This impression is 
reinforced by recent work on a multispanning inner membrane protein 
showing that at least three consecutive transmembrane segments can 
be cross-linked simultaneously to YidC (Beck et al., 2001). At the same 
time, these segments form lipid cross-links, suggesting that YidC can in- 
teract with multiple transmembrane segments in a nascent protein and 
that it also allows the transmembrane segments to contact lipids. Thus, 
either the transmembrane segment can partition dynamically between 
the YidC complex and surrounding lipids, or lipids can penetrate deep 
into the YidC complex. 

In addition to the SecYEG-YidC translocon, E coli also has a machi- 
nery that can translocate folded polypeptides to the periplasmic space. 
Signal peptides that target proteins to this machinery contain a diag- 
nostic Arg-Arg motif (Berks et al., 2000); hence the name twin arginine 
translocation (Tat) system. The Tat system is also found in the thylakoid 
membrane of chloroplasts. Most Tat substrates identified so far are sol- 
uble proteins, but at least one inner membrane protein has been re- 
ported to use this pathway (Molik et al., 2001). It will be very interesting 
to see whether the requirements in terms of hydrophobicity and flank- 
ing charged residues are the same for Tat-targeted and SecYEG-targeted 
membrane proteins. 

Finally, a few words on the assembly of outer membrane f-barrel pro- 
teins. These proteins are made with an N-terminal signal peptide that 
targets them for translocation across the inner membrane via the SecYEG 
translocon. Aided by periplasmic chaperones such as Skp (de Cock et al., 
1999; Harms et al., 2001; Schafer et al., 1999), the protein then folds 
and inserts into the outer membrane, possibly in a concerted folding- 
insertion reaction. Folding and insertion can be reconstituted in vitro 
(Tamm et al., 2001), although the kinetics seen so far are orders of mag- 
nitude slower than in vivo. It is completely unclear how the orientation of 
the ß-barrel proteins across the outer membrane is controlled; the regu- 
larities noted so far are (i) short loops face the periplasm, long loops the 
extracellular side, and (ii) the N- and C-termini tend to be located on 
the periplasmic side (Schulz, 2000), but whether these characteristics are 
just correlations or actually help determine the orientation is not known. 


V. MEMBRANE PROTEIN ASSEMBLY IN MITOCHONDRIA 


Mitochondria import most of their outer and inner membrane pro- 
teins from the cytosol, but also encode a small number of inner 
membrane proteins in their own genome. As noted above, for proteins 
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that are translated in the mitochondrial matrix, insertion into the inner 
membrane has been found to depend on the YidC homologue Oxalp 
(Hell et al., 1997, 1998, 2001) and their topology seems to obey the 
positive inside rule (Rojo et al., 1999). 

Three different assembly pathways have been suggested for the 
imported inner membrane proteins. All three utilize the so-called 
TOM complex for translocation through the outer membrane. Two 
of the pathways further make use of the same TIM23 complex in the 
inner membrane that also handles soluble matrix proteins; the third 
involves both a distinct set of chaperones in the intermembrane space 
as well as a distinct inner membrane insertion machinery, the TIM22 
complex. 

The TIM22 pathway is used by, for example, proteins that belong to 
the mitochondrial carrier protein family (Koehler, 2000; Pfanner and 
Geissler, 2001). After being chaperoned through the intermembrane 
space by the “Tiny TIM complexes” TIM9/10 and TIM8/13, proteins on 
this pathway are delivered to the inner membrane TIM22 complex and 
are directly inserted into the bilayer. It appears that positively charged 
loops are driven across the inner membrane by the membrane potential, 
but not much else is known about the insertion mechanism. 

The two TOM-TIM23 pathways—the “conservative sorting” and “stop- 
transfer” pathways—differ in that the former posits a process in which 
the inner membrane protein is first fully imported into the matrix space 
and then inserted into the inner membrane from the matrix side using 
the Oxalp complex (Hell et al., 2001; Stuart and Neupert, 1996). This 
process is “conservative” in an evolutionary sense, since the prokaryotic 
ancestor of present-day mitochondria presumably inserted all their in- 
ner membrane proteins from the cytoplasmic (i.e., matrix) side of the 
membrane. 

In contrast, in the “stop-transfer” model, hydrophobic segments in the 
imported inner membrane protein get stuck in the TIM23 channel and 
leave the translocon laterally, in much the same way as is envisaged for 
the Sec61 and SecYEG translocons (Tokatlidis et al., 1996). Thus, there is 
no matrix-localized intermediate in this case. Possibly, this may explain 
an early observation (Gavel and von Heijne, 1992) that some imported 
inner membrane proteins follow the positive inside rule (presumably 
those that use the “conservative sorting” or the TIM22 pathways) whereas 
others do not (presumably those that are inserted by the stop-transfer 
mechanism). 

Whether the different mitochondrial inner membrane assembly path- 
ways impose different constraints on the final structure of the inner 
membrane proteins is not known. 
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VI. MEMBRANE PROTEIN ASSEMBLY IN CHLOROPLASTS 


Chloroplasts in higher plants have three membranes: the outer and 
inner envelope membranes and the thylakoid membrane. Very little is 
known about membrane protein assembly into the two envelope mem- 
branes (Soll and Tien, 1998). The thylakoid has been better studied and 
in fact appears to use mechanisms very similar to those found in E. coli 
for membrane protein insertion (Dalbey and Robinson, 1999). Thus, 
SRP, SecA, SecYEG, YidC, and Tat homologues are all present in the 
thylakoid membrane or in the stroma (the Tat system was first identified 
in thylakoids, in fact). In contrast to E. coli, however, there are thylakoid 
proteins that appear to insert “spontaneously” into the membrane, in- 
sofar as no requirement for any of the known translocation machineries 
has been detected (Mant et al., 2001). 


VII. MEMBRANE PROTEIN ASSEMBLY IN PEROXISOMES 


So far, studies of protein import into peroxisomes have focused largely 
on soluble matrix proteins (Fujiki, 2000; Sacksteder and Gould, 2000). 
The few membrane proteins studied to date appear to have targeting 
signals with no obvious consensus sequence and that may be located in 
either matrix or cytoplasmic domains (Honsho and Fujiki, 2001). No 
targeting signal receptor has been identified so far. A requirement for 
ATP and (unknown) cytosolic factors has been demonstrated (Just and 
Diestelkotter, 1996; Pause et al., 1997), but this is where the story ends 
for the time being. 


VIII. CONCLUSIONS 


In what sense is what we know of membrane protein assembly in vivo 
relevant for our general understanding of membrane protein structure? 
Except for the rather trivial fact that most membrane proteins have 
a unique orientation relative to the membrane and cannot reorient 
on the time scale of their typical lifetime—which suggests a nonequi- 
librium distribution resulting from the assembly process—it is unlikely 
that a membrane protein is not at thermodynamic equilibrium with the 
surrounding bilayer. From this point of view the membrane insertion 
process may be viewed as one part of the folding process: interesting 
from a basic chemistry point of view but not necessarily very important 
for understanding or predicting the 3D structure. 

However, it is already clear that the mechanisms responsible for mem- 
brane protein integration into the ER membrane and the inner bacte- 
rial membrane do place certain constraints on the “allowable” structures 
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and in some (rare?) instances even prevent thermodynamic equilibrium 
from being reached. Thus, it is hard to imagine that proteins with “frus- 
trated” topologies (i.e., with one or more strongly hydrophobic segments 
prevented from forming transmembrane helices by strongly charged 
flanking regions) represent equilibrium structures; rather, they most 
likely represent kinetically trapped states. 

Whether natural proteins with “frustrated” topologies exist is an open 
question, but it is likely that another constraint discussed above—the 
differences in the requirements for inducing a helical hairpin with a 
lumenal versus a cytoplasmic loop—has helped shape natural membrane 
proteins. This does not appear to be a very strong constraint, however, 
and will only become apparent when large numbers of 3D structures 
have been solved. 

Whether the transmembrane helices themselves are constrained by 
the assembly process is less clear. It is not unlikely that there are fine 
differences in the sequence requirements for insertion of a transmem- 
brane helix via a translocon compared with direct partitioning into a 
lipid bilayer, although no such comparative studies have been made 
(and they may be difficult to conduct). For instance, one can well 
imagine a situation where a marginally hydrophobic segment in a pro- 
tein may be sufficiently shielded from the surrounding lipid to simply 
pass through a translocon even though it could in principle form a 
stable transmembrane helix if given the opportunity. From this point 
of view, it would be interesting to compare the final structures of a 
given membrane protein when targeted either to the SecYEG-YidC or 
the Tat translocon in the E. coli inner membrane; they may well be 
different. 

To conclude, membrane protein assembly processes are both diverse 
and highly evolved, attesting to the fact that strongly hydrophobic seg- 
ments in proteins need to be guarded against and contained by tightly 
regulated processes in order not to wreak havoc on the cell. The lipid 
membrane may be a perfect shield against the environment, but “func- 
tionalizing” it with proteins is not an easy task! 
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I. INTRODUCTION 


Integral membrane proteins comprise roughly 30% of all proteins 
and are involved in many vital cellular processes such as environmental 
perception, cellular communication, and nutrient metabolism. Conse- 
quently, our understanding of cell biology will never be complete without 
a detailed understanding of membrane protein function and the inter- 
molecular forces that act in the bilayer. Although progress on membrane 
proteins has been slow because of many technical challenges, reports of 
new structures and investigations of folding and stability are becoming 
more frequent as we develop more experimentally accessible systems. In 
this review we present some of our current understanding of the struc- 
ture of a-helical membrane proteins and the forces that stabilize their 
structures. We focus on the folding and stability of membrane proteins 
within the bilayer rather than on the thermodyamics of helix insertion 
into the membrane. 
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II. TRANSMEMBRANE HELIX STRUCTURE 


Because transmembrane helices must span the membrane, they are 
longer on average than helices in soluble proteins, averaging about 
26 residues (Bowie, 1997b). The membrane-spanning requirement also 
limits the orientation of the helices in the bilayer. The greater the tilt 
angle (the angle between the helix axis and the membrane normal) the 
longer the helix needs to be to span the bilayer. Tilt angles range from 
0° to 40° in a fairly even distribution according to a survey of transmem- 
brane helices (Bowie, 1997b). When normalized for the probability of 
an observation (0° angles are much less probable than 40° angles in 
three dimensions), it is clear that transmembrane helices strongly pre- 
fer smaller angles. Indeed, single TM helices in the absence of tertiary 
interactions tend to align with the bilayer normal (Huschilt et al., 1989). 
These tilt angle restrictions have a significant effect on helix packing 
interactions (see below). 

Many TM helices are not ideal, straight, and regular, as is illustrated by 
the seven helices of rhodopsin (Palczewski et al., 2000) shown in Fig. 1. 
TM helices exhibit frequent distortions from ideality, including kinks 
and regions of x and äu helix. These distortions can cause the axis of 











Fic. 1. Curvature or kinks in rhodopsin helices. Individual helices from rhodopsin 
(1JGJ) (Palczewski et al., 2000) are shown in two perspectives in their order in the amino 
acid sequence. Lines are drawn approximately through the helix axes for reference. The 
amino acids shown are 1-26 (helix 1), 34-57 (helix 2), 71-90 (helix 3), 95-117 (helix 4), 
122-149 (helix 5), 155-185 (helix 6), and 190-217 (helix 7). 
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the helix to bend as much as 30° and lead to irregularities in side-chain 
placement (Riek et al., 2001). Oftentimes, these distortions are associ- 
ated with Pro residues. Pro is strongly destabilizing in the middle of a 
helix because ofsteric clashes and the loss ofa backbone hydrogen bond. 
Nevertheless, Pro is relatively common in TM helices, probably because 
of the high stability of helices in membranes (White and Wimley, 1999). 
In addition, weak C—H - -- O hydrogen bonds have been noted between 
the C; atom of the Pro side chain and the otherwise unsatisfied carbonyl 
oxygen of the preceding residue, which may help to minimize helix 
destabilization Chakrabarti and Chakrabarti, 1998). The presence of a 
Pro residue does not always lead to significant helix distortions, however, 
and about one-third of the observed helical distortions do not involve 
Pro (Riek et al., 2001). The cause of these nonproline distortions is not 
obvious and the extent to which the local sequence versus tertiary struc- 
ture defines the helical distortions is still not clear. These noncanonical 
helical regions may be functionally important, providing weak points 
in the helix that allow for conformational changes in the membrane 
(Jacob et al., 1999; Sansom and Weinstein, 2000). For example, Jacob 
et al. (1999) found that a GXXP motif in alamethecin was responsible 
for bending motions in the peptide. Interestingly, both the Pro and the 
preceding Gly were required for this flexibility, suggesting that it may 
be possible to define local sequence motifs that facilitate motion in trans- 
membrane domains. 


A. Transmembrane Helix Packing 


The range of helix packing angles in membrane proteins is more re- 
stricted than in soluble proteins and the distribution of helix packing 
angles shows a strong peak at around +20° (Fig. 2) (Bowie, 1997b). In 
fact, 62% of helix packings fall in the range from 0° to +40°. Part 
of the preference for these packing angles is due to the restrictions 
on tilt angles (noted above) that lead to a high probability of ob- 
serving small packing angles. It is not the only factor, however, since 
small positive packing angles are greatly favored over small negative 
angles. Small positive packing angles are likely to be favored because 
of side-chain packing constraints. Packing angles around +20° allow 
side-chain interdigitation without steric conflicts according to both the 
ridges-into-grooves or knobs-into-holes helix packing models (Crick, 
1953; Chothia et al., 1981; Walther et al., 1996). Thus, the +20° angle 
allows packing of long helices over much of their length without steric 
conflicts. Langosch and Heringa pointed out that contacting TM helices 
can exhibit a superhelical twist commonly observed in coiled-coil 
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Fic. 2. Helix packing angle (Q) distributions. Gray bars: The distribution of angles 
seen in 3498 helix packing interactions from soluble proteins. Black bars: The distribution 
seen in 88 helix interactions from the membrane proteins, bacteriorhodopsin, bovine 
cytochrome c oxidase, and photosynthetic reaction center (Bowie, 1997a). 


proteins (Langosch and Heringa, 1998). This superhelical bending 
allows helices to extend their area of contact where straight helices would 
diverge. 

The majority of helix packing interactions in membrane proteins 
are antiparallel, but this preference is greatly influenced by topological 
constraints. Transmembrane helices that are adjacent in sequence 
virtually always pack against each other and topological restrictions dic- 
tate that these be antiparallel. If one examines helix packings between 
protein subunits, which are largely free of topological constraints, the 
preference for antiparallel packing largely disappears (Bowie, 1997b). 
Although helix dipole interactions might be expected to favor antipar- 
allel packings, it must not be a strong factor. Calculations by Ben-Tal 
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and Honig suggest that helix dipole interactions largely disappear due 
to solvation once the helix emerges from the membrane (Ben-Tal and 
Honig, 1996). 


B. Residue Environment Preferences 


When the first soluble protein structures were revealed, it was imme- 
diately obvious that there was a striking preference for hydrophobic 
side chains in the interior and polar side chains on the surface (Perutz 
et al., 1965). This simple observation has had a powerful impact on 
our ability to predict structural features of soluble proteins (Eisenberg 
and McLachlan, 1986; Bowie et al., 1990, 1991). An examination of the 
first membrane protein structures found a much more subtle contrast 
between interior and surface residues. Rees et al. (1989a) found that 
the interiors of membrane proteins are about as hydrophobic as the 
interiors of soluble proteins, but the lipid facing residues were some- 
what more apolar. Thus, residue preferences for buried or surface en- 
vironments are much weaker for TM helices than for soluble proteins. 
A number of more recent surveys have noted a preference for small 
residues at helix interfaces (Javadpour et al., 1999; Jiang and Vakser, 
2000). Possible reasons for this preference include the smaller entropy 
cost for fixing small side chains on folding, or better packing by the 
smaller residues. Consistent with the latter possibility, several groups 
report better packing for small side chains than for large side chains 
in membrane proteins (Javadpour et al., 1999; Adamian and Liang, 
2001). The opposite is apparently true for soluble proteins (Adamian 
and Liang, 2001). The origin of this improved packing still remains to be 
explained. 

One of the most conspicuous features of residue distributions in 
membrane proteins is an “aromatic belt,” which sandwiches the apolar 
transmembrane domain. When aromatic residues are specifically 
displayed in membrane protein structures, a clustering in the interfacial 
region is often quite obvious. One example is shown in Fig. 3 (see 
color insert). Umschneider and Samsom (2001) have noted that this 
irregular distribution is restricted to Trp, His, and Tyr side chains, with 
Phe showing little preference for its location in the membrane. Thus, 
it appears that aromaticity and amphilicity are the preferred features. 
This aromatic belt reflects the favorable partitioning of aromatic amino 
acids into the interfacial region (Wimley and White, 1996; Yau et al., 
1998) and may act to stabilize the orientation of the entire membrane 
protein or individual helices in the bilayer. 
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HI. THERMODYNAMIC STUDIES 


A. Models 


Several models have emerged to describe the folding process of 
membrane proteins. In 1990, Engelman and co-workers proposed a 
two-stage folding model (Popot and Engelman, 1990) based on experi- 
mental results of bacteriorhodopsin fragments. [See Fig. 4A and Booth 
(2000).] In stage 1, the helices form and insert into the membrane. 
The unfavorable entropy of folding the backbone is overcome by the 
favorable partitioning of the hydrophobic side chains into the mem- 
brane. Deber and co-workers have defined a threshold hydrophobicity 
that, once surpassed, enables helix insertion into the bilayer (Liu et al., 
1996). Indeed, hydrophobicity is such a strong determinant of parti- 
tioning and consequent secondary structure formation that algorithms 
for predicting the TM segments are remarkably successful compared 
to secondary structure prediction algorithms in soluble proteins (Rost 
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Fic. 4. Thermodynamic models of membrane protein folding. (A) In the two-step 
model of folding, the TM helices fold and insert into the membrane and subsequently 
move laterally to associate (Popot and Engelman, 1990). (B) White and co-workers con- 
structed an expanded model, which includes the protein residing in water (states 1-3), 
the interfacial region (states 4 and 5), and the membrane core (states 6 and 7) (White 
and Wimley, 1999). 
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et al., 1996; Deber et al., 2001). In stage 2, the individual helices asso- 
ciate within the membrane to form the folded structure. Consistent with 
this model, some individual helices of bacteriorhodopsin were shown to 
insert into bilayers as isolated helices. The tertiary structure reformed 
on mixing the fragments with the complementary fragments containing 
the other helices (Popot et al., 1987; Kahn et al., 1992; Hunt et al., 1997; 
Marti, 1998; Luneberg et al., 1998). Thus, many TM segments appear to 
be independently stable and follow this two-stage folding model. 

The two-stage model is conceptually very useful because it separates 
the thermodynamics of insertion from helix assembly. When examining 
the stability of membrane proteins in the bilayer, the second stage is 
presumably the most relevant since the extrusion of the protein back into 
the aqueous environment should be extremely unfavorable. This idea is 
supported by the experiments discussed below. The two-stage model may 
not contain sufficient detail to describe the folding process in all cases, 
however. For example, the F and G helices of bacteriorhodopsin do not 
spontaneously form helices in vesicles, indicating that these peptides 
require assistance from the remainder of the protein to fold properly 
(Hunt et al., 1997). 

To develop a more complete thermodynamic framework for mem- 
brane protein folding, White and co-workers have created a three-stage 
and then a fourstage model of folding (White and Wimley, 1999). As 
shown in Fig. 4B, the model includes various folding and partition- 
ing steps. The protein may be unfolded (states 1 and 4), folded into 
noninteracting helices (states 2, 5, and 6), or fully folded with the 
TM helices packed together (states 3 and 7). These different confor- 
mations can exist in water (states 1-3), the interfacial region (states 4 
and 5), or the hydrocarbon region of the membrane (states 6 and 7). 
Cofactor binding to the packed helices and loop folding would be 
additional states not shown. Certainly, this model is a more complete 
formalism for discussing membrane protein stability. The experimental 
determination of the free energies involved is a Herculean task, how- 
ever. Many of the states are not significantly populated or are prone to 
aggregation. Nevertheless, a quantitative description of all the folding 
equilibria is the ultimate prize. 


B. Experiments 


Thermodynamic measurements will, in the end, provide the quanti- 
tative descriptions of the atomic interactions needed for computational 
predictions of membrane proteins. With membrane proteins, of course, 
these interactions must include all the interactions among the water, 
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its solutes, the membrane, and the protein itself. Unfortunately, true 
thermodynamic measurements are difficult with membrane proteins, 
because it is often difficult to find solution conditions that yield re- 
versible transitions. Usually we must rely on nonequilibrium measure- 
ments, such as the time required for inactivation at a given temperature, 
to judge the relative stability of different proteins. Furthermore, the sys- 
tems used are often complex and the nature of the denatured state is 
still poorly understood, limiting our ability to interpret the free ener- 
gies found in thermodynamic folding studies. Thus, many of our ideas 
concerning the forces that stabilize membrane proteins are based on 
theoretical arguments or qualitative comparisons. Ultimately, equilib- 
rium measurements of stability will be essential for validating theories 
and for developing a better understanding of the dominant forces in 
membrane protein stabilization. 

Measurement of unfolding free energies requires the ability to 
measure the relative populations of folded and unfolded proteins. This 
measurement is usually difficult under native conditions because only a 
tiny fraction of the protein is unfolded and this fraction is difficult to de- 
tect. Consequently, unfolding free energies are usually measured under 
conditions that produce marginal stability and the results are extrapo- 
lated back to conditions without denaturant. In membrane protein ex- 
periments, the protein stability has been modulated by urea, guanidine 
hydrochloride, pH, sodium dodecyl sulfate (SDS), and temperature. 
Reversible unfolding has been achieved in a few cases with guanidine 
hydrochloride, urea, and SDS, but not with temperature (Surrey and 
Jahnig, 1992; Klug et al., 1995; Lau and Bowie, 1997; Klug and Feix, 
1998; Chen and Gouaux, 1999). 

Thermal unfolding has been examined for a number of membrane 
proteins, including bacteriorhodopsin, cytochrome c oxidase, band 3, 
and photosystem II. These studies have been thoroughly reviewed by 
Haltia and Freire (1995), so we will simply highlight some of the 
general conclusions that seem to emerge: (1) Complete thermal dena- 
turation of the membrane domains is invariably irreversible. In the 
case of bacteriorhodopsin, several studies reported that the unfold- 
ing transition was scan rate independent, suggesting that thermody- 
namic analysis could still be applied (Galisteo and Sanchez-Ruiz, 1993; 
Shnyrov et al., 1994). This conclusion was subsequently disputed, 
however (Galisteo and Sanchez-Ruiz, 1993; Shnyrov et al., 1994). (2) Un- 
folding of the transmembrane domains involves minimal loss of he- 
lical secondary structure. For example, complete denaturation of bac- 
teriorhodopsin leads to loss of only about 15% of the helical content 
(Kahn et al., 1992) and in bacterial cytochrome c oxidase, a protein with 
large extramembrane domains, roughly two-thirds of the helical content 
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remains after complete denaturation (Haltia et al., 1994). (3) The de- 
naturation enthalpies are generally much lower than for soluble pro- 
teins. In fact, the denaturation enthalpies of membrane proteins can be 
largely accounted for by denaturation of the water-soluble portions only 
(Haltia and Freire, 1995). Overall, the results are consistent with the idea 
that the membrane-embedded portions of transmembrane proteins are 
more stable than the soluble portions. At high temperatures much of 
the transmembrane helical structure, and perhaps some of the tertiary 
interactions, remains intact. 

Equilibrium unfolding experiments using SDS as a denaturant have 
been performed on bacteriorhodopsin and diacylglycerol kinase (DGK) 
(Lau and Bowie, 1997; Chen and Gouaux, 1999). In these experiments, 
increasing concentrations of SDS are added to a protein solubilized in 
a nondenaturing detergent or lipid/detergent mixture. For both sys- 
tems the unfolding curves were largely reversible. In the case of DGK, 
the unfolding occurred within the detergent micelles since the un- 
folding curves depended on the mole fraction of SDS rather than its 
bulk concentration (Lau and Bowie, 1997). For bacteriorhodopsin, SDS 
denaturation exhibited a single unfolding transition, whereas for DGK 
two unfolding transitions were observed. The soluble domain of DGK 
unfolded at lower SDS concentrations than the transmembrane por- 
tions, suggesting that the transmembrane domain has higher over- 
all stability—a result consistent with the thermal denaturation studies. 
Extrapolations of the unfolding free energies to zero denaturant, 
with admittedly dubious theoretical justification, indicated stabilities of 
6 kcal/mol for the soluble domain and 16 kcal/mol for the membrane 
embedded domain at 25°C. Thus, the membrane-embedded domain 
appears to be at the high end of the typical stability range of soluble 
proteins, even in detergent solution. Presumably, the stability would be 
even higher in a bilayer environment since detergent solubilization is 
invariably destabilizing (Brouillette et al., 1989; Bowie, 2001). 

The nature of the unfolded state in denaturant and how it relates 
to the denatured state under native conditions in the bilayer is a ma- 
jor issue in all denaturation experiments. Thermodynamic arguments 
from the two-stage model suggest that the relevant denatured state has 
lost its tertiary structure and maintained the transmembrane helix sec- 
ondary structure. As noted above, CD spectra on thermally denatured 
bacteriorhodopsin suggest that the denatured protein maintains most 
of its helical secondary structure. The extent to which tertiary struc- 
ture is disrupted is unclear, however. It is possible that some stable in- 
terhelical interactions are maintained even at high temperature. The 
helical secondary structure content is also maintained in SDS micelles, 
and near-UV circular dichroism (CD) spectra suggest substantial loss or 
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rearrangement of the tertiary structure in diacylglycerol kinase (Zhou 
and Bowie, unpublished). Thus, the SDS unfolding may reasonably 
mimic the second step of the two-stage model and has the advantage 
of being reversible. Clearly, however, the complexities of detergent mix- 
ing in the protein-detergent complex may be difficult to separate from 
the protein stability. 

Measurement of dissociation constants of TM helix peptides is perhaps 
the most promising approach for evaluating the strengths of specific 
interactions. For example, equilibrium sedimentation and fluorescence 
quenching have been used to obtain dissociation constants for various 
TM peptides in detergent solutions (Fleming et al., 1997; Fisher et al., 
1999; Choma et al., 2000; Zhou et al., 2000, 2001; Fleming and Engel- 
man, 2001; Gratkowski et al., 2001). These systems offer the opportunity 
to compare the free energies of association for TM helix variants. The 
fact that the measurements must be made in detergents rather than in 
a lipid bilayer is a drawback. Fleming and Engelman (2001) showed, 
however, that the stabilities of a series of glycophorin A transmembrane 
helix mutants follow the same order in different environments, suggest- 
ing that relative stabilities may be environment independent. Thus, it 
may be possible to obtain a quantitative picture of the contributions of 
particular interactions in the bilayer from measurements in detergent. 
Naturally it would be ideal to obtain equilibrium measurements in a 
bilayer environment, but this is extremely challenging. In some cases 
with weak association, it has been possible to use fluorescence quench- 
ing to measure dissociation constants (Mall et al., 2001). Additionally, 
Isenbarger and Krebs (2001) have developed a method to measure the 
equilibrium stability of the bacteriorhodopsin lattice and have used it 
to measure the lattice stability of various mutants. These sorts of experi- 
ments, which quantitatively measure the association or folding reactions 
in different conditions, will allow us to understand the energetic contri- 
butions of the different forces that stabilize membrane proteins. 


IV. THE CONTRIBUTION OF LOOPS VERSUS TRANSMEMBRANE HELICES 


Is the manner in which transmembrane (TM) helices pack together 
dictated by the sequences contained within the membrane or by the 
extramembranous sequences, or “loops”? There is now considerable 
evidence that most loops are not essential in specifying the fold of mem- 
brane proteins. First, in many cases, the TM helices can encode con- 
siderable information for specifying the fold. Many single TM helices, 
such as the TM helix from glycophorin A, self-associate in the absence 
of their extramembranous domains (Lemmon et al., 1992a,b). Second, 
there are many examples in which the loops between TM helices in 
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membrane proteins can be clipped without losing the ability to fold and 
function. For example, Lac permease, when expressed as two comple- 
mentary halves in E. coli, reassembles into a functional permease (Bibi 
and Kaback, 1990). Two of the three loops between TM helices could be 
cleaved in the anion channel, CLC-1, without loss of activity (Schmidt- 
Rose and Jentsch, 1997). The band 3 anion exchanger in red blood cells 
can even be cleaved up to 4 times in the loops and still retain function 
(Groves et al., 1998). Moreover, a peptide of the first two helices of bac- 
teriorhodopsin can associate specifically with a fragment containing the 
remaining helices to reconstitute activity (Marti, 1998). Finally, the loops 
of bacteriorhodopsin can be shortened (Gilles-Gonzalez et al., 1991) or 
replaced with heterologous sequence (Allen et al., 2001; Kim et al., 2001) 
without the loss of function. 

Despite the fact that many loops are not critical for the organization 
of some TM helices, loops still play a stabilizing role. In the case of 
bacteriorhodopsin, clipping of the loops or replacing them with het- 
erologous sequences leads to a significant loss of stability. Katragadda 
et al. (2000) showed that the solution structure of three loop peptides 
from bacteriorhodopsin resemble the structure of the same sequence in 
the full-length structure. This finding suggests the sequences may have 
been optimized for a particular geometry. Moreover, like loops, some 
TM helices can be replaced with heterologous sequences without loss 
of function (Pohlschroder et al., 1996; Guzman et al., 1997; Zhou et al., 
1997). In diacylglycerol kinase, a variant in which the entire first TM 
helix is replaced with polyalanine still maintains activity within twofold 
of the wild-type protein (Zhou et al., 1997). This helix is nevertheless 
an integral part of the structure since sequence changes within this he- 
lix both stabilize (Zhou and Bowie, 2000) and destabilize the structure 
(Zhou et al., 1997) and key active site residues occur both N-terminaland 
C-terminal to it (Wen et al., 1996). This transmembrane helix must be 
largely passive in the sense that the sequence does not play an active role 
in defining the structure of the protein. Thus, while some TM helices 
encode structural specificity, other helices do not and the loop regions 
contribute significantly to the overall stability of membrane proteins. 


V. FORCES THAT STABILIZE TRANSMEMBRANE HELIX INTERACTIONS 


A. Lipid Interactions with Transmembrane Proteins 


Lipid bilayers are complex environments and the activity of membrane 
proteins can be modulated by the overall properties of the lipid bilayer, 
as well as by specific interactions with individual lipids. Overall bilayer 
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properties that can influence membrane protein stability include thick- 
ness, curvature, and rigidity. These properties can differ in the various 
cell compartments. Moreover, within a given compartment, different bi- 
layer environments can exist. Although the ideal fluid mosaic model of 
biological membranes includes the free diffusion of lipids and complete 
mixing of all the membrane components, the heterogenous mixing of 
certain lipids and the formation of lipid microdomains, or “rafts,” occur 
in the bilayer (Kurzchalia and Parton, 1999). Rafts rich in sphingolipids 
and cholesterol have been implicated in important biological functions 
such as protein sorting and cell signaling (Simons and Ikonen, 1997; 
Brown and London, 1998; Ikonen, 2001). Rafts can selectively include 
or exclude various proteins. For example, certain rafts attract glucosyl- 
phosphatidylinositol (GPI) anchored proteins (Hooper, 2001). The tar- 
geting of the raft-attached proteins could be directed by the destination 
of the raft. A growing body of literature supports the involvement of rafts 
in vesicle formation and protein sorting, such as the apical targeting of 
proteins in epithelial cells (Ikonen and Simons, 1998) and the sorting 
of prohormone convertase 2 (Blazquez et al., 2000). Preferential sorting 
of proteins into microdomains could have important concentration ef- 
fects in organizing membrane protein complexes (Mall et al., 2001). Re- 
ceptor clustering, an important step in signal transduction, sometimes 
occurs in lipid microdomains. The actual clustering of the receptor and 
the recruitment of other required proteins can enhance the signal pro- 
cessing. Rafts have been implicated in the signaling of various recep- 
tors, such as the FceRI receptor (Sheets et al., 1999), the T-cell receptor 
(Langlet et al., 2000), the EGF receptor (Waugh et al., 1999), and the 
ephrin-Bl receptor (Brueckner et al., 1999). The different bilayer en- 
vironments could also play important roles in modulating membrane 
protein structure and stability. In this section, we describe some of the 
ways different bilayer properties may influence membrane proteins. 


1. Hydrophobic Matching 


Hydrophobic matching occurs when the thickness ofthe hydrophobic 
surface of the lipid bilayer equals the hydrophobic thickness of a mem- 
brane protein. The stability of membrane proteins may be enhanced by 
hydrophobic matching or reduced if the hydrophobic region is either 
too long or too short, creating a “hydrophobic mismatch.” For example, 
the activity of diacylglycerol kinase (DGK) is dependent on the length 
of the hydrophobic tails of the bilayer (Pilot et al., 2001). Lipids with 
tail lengths of 18 carbons were found to enhance DGK activity over the 
activity seen in lipids of tail lengths 16 and 20. Maneri and Low (1988) 
found that the stability of Band3 increases with increasing hydrocarbon 
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Fic.5. Responses ofa TM protein to hydrophobic mismatch. The hydrophobic regions 
of a TM protein (black regions) may be too long for the lipid core, creating a mismatch. 
To help reduce this stress, the protein may change its tilt angle or undergo more favorable 
associations. The protein may associate with a specific lipid, with a different tail length 
or curvature, or with another protein to reduce the lipid-facing surface area. 


chain length. Hydrophobic matching could also play a role in determin- 
ing the tilt angle of transmembrane helices as shown in Fig. 5. Indeed, 
Williamson et al. (2002) found that the structure of KCSA adjusts to 
match the thickness of the bilayer in which it is placed. 

The hydrophobic matching principle can also lead to heteroge- 
nous mixing of lipids in biological membranes. The length of the 
hydrocarbon tails found in membrane lipids varies between 12 and 24 
carbons, potentially leading to a sizable ~100% change in the thickness 
of the hydrophobic region of the corresponding bilayers. If different 
lipid species cannot match their hydrophobic regions, a lipid might 
preferentially associate with its own type, forcing a partitioning into mi- 
crodomains (Lehtonen et al., 1996). Furthermore, membrane proteins 
may force anonrandom distribution of lipids in the bilayer. When bacte- 
riorhodopsin is in mixed lipid vesicles of dilauroylphosphatidylcholine 
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(DLPC) and distearoylphosphatidylcholine (DSPC), it preferentially 
associates with DLPC at low temperatures (Dumas et al., 1997). There- 
fore, in the presence of bacteriorhodopsin, the mixing of lipids within 
the bilayer is not complete as would be expected in the ideal fluid 
mosaic model. This preferential association of the protein with a certain 
lipid has been referred to as “molecular sorting” (Dumas et al., 1997). 

Hydrophobic matching is just one example in which a region of a 
molecule, that is, the protein, has chemical properties that match the 
properties of its surroundings, the lipids. With this view in mind, we 
should also consider how well the surface of the membrane protein 
matches the interfacial and the water-exposed regions of the bilayer. 
For example, aromatic residues have an affinity for the interface region 
(Wimley and White, 1996; Yau et al., 1998) and the positively charged 
residues have an affinity for the phosphate head groups. 

Despite our simple representations of the membrane, the borders 
separating the water, the interfacial regions, and the core regions are not 
clearly defined and discrete. A better description might be to view the 
positions of the chemical groups of the bilayer in distributions that have 
a certain mean position and a sizable width (Nagle and Tristram-Nagle, 
2000). These “fuzzy” boundaries between regions in the bilayers should 
give the bilayer some flexibility in trying to relieve the stress caused 
from mismatching chemical properties. Figure 5 illustrates how some 
membranes may adjust to relax the strain of a hydrophobic mismatch. 
The strain may lead to the tilting of the protein or the formation of 
protein-protein or lipid-protein associations. 


2. Lateral Capillary Forces 


A hydrophobic mismatch between a membrane protein and the sur- 
rounding lipids may create a lateral force that would pull membrane pro- 
teins together. A general theoretical description of this force, referred 
to as a “lateral capillary force,” has been presented by Kralchevsky and 
co-workers (Kralchevsky, 1997; Kralchevsky and Nagayama, 2000). Al- 
though experimental verification of this force for membrane proteins 
in a bilayer has not been demonstrated, the force can be observed in 
larger systems, such as 1.7 um latex beads at an air/water interface, 
and would be expected to operate on membrane proteins (Kralchevsky, 
1997). 

Basically, the lateral force originates from the deformation of the bi- 
layer in the presence of a protein with a hydrophobic region of different 
thickness than the bilayer. An example in which the hydrophobic re- 
gion of the protein is too long for the bilayer tails is illustrated in Fig. 6. 
(Similar arguments apply if the protein is too short for the lipid tails.) 
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Fic. 6. Lateral capillary forces can assist protein-protein associations. A meniscus is 
formed around each TM protein when the hydrophobic regions of the protein (black 
regions) are not the same length as the hydrophobic regions of the membrane. The 
unfavorable deformation can, in part, be relieved by the lateral association of two proteins, 
which reduces the surface area the proteins expose to lipids. 


The deformation in the bilayer, the meniscus, forms around the pro- 
tein to reduce the exposure of the hydrophobic region. If two protein 
molecules contact each other, the number of deformed lipid molecules 
would be reduced because of the reduction in the exposed surface area 
of the protein. 

The magnitude of this force for membrane proteins is unknown and is 
certain to be affected by various properties of the bilayer. First, the lateral 
capillary force diminishes as the mismatch becomes smaller. Second, the 
diagram in Fig. 6 presumes that the lipids do not have an inherent cur- 
vature. Many lipids have a tendency to curve around their head groups, 
however (see below). In the mismatch shown, the meniscus could actu- 
ally relieve the curvature stress created from the lipids tendency to curve. 
If the shape of the meniscus matched the curvature of the lipids, no 
lateral capillary force would be produced. On the other hand, the oppo- 
site hydrophobic mismatch, with the lipid tail being longer than the pro- 
tein, would exacerbate the curvature stress and produce a larger lateral 
capillary force. In this manner, the lipid curvature and the hydrophobic 
length both interact to determine the magnitude of the lateral capillary 
force for a given membrane protein. 


3. Curvature 


Bilayers are ideal for cylindrically shaped lipids, where the hydrocar- 
bon tails are the same size as the head groups (see Fig. 7). The tails of 
many lipids prefer to occupy a larger area, however, giving these lipids 
an inherent tendency to curve. The flattening of these lipids into a pla- 
nar bilayer causes a curvature stress, in which the center of the bilayer 
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Fic. 7. Schematic diagrams of different phases of amphiphiles. Amphiphiles, such as 
detergents with large head groups (gray) containing only one small hydrophobic chain 
(tail), can be thought of as cones that pack together to form spherical micelles. Bilayer- 
forming lipids are more cylindrical in shape and pack together into planes. The head 
groups of nonbilayer lipids occupy less area than the tails and therefore the lipids form 
inverted phases. 










is at higher pressure than the head groups. In some solutions contain- 
ing a single lipid component, such as cardiolipin or monogalactosyl- 
diacylglycerol (MGDG), the inherent curvature leads to the formation 
of other lipid phases, such as the inverted hexagonal Hy phase. These 
non-bilayer-forming lipids are present in biological membranes, but in 
these cases, the formation of the bilayer is maintained by the other lipids 
or membrane proteins. For example, the inverted hexagonal phase of 
MGDG, a thylakoid membrane lipid, is inhibited by the addition of the 
light-harvesting complex II, a major protein of the thylakoid membrane 
(Simidjiev et al., 2000). It was also shown that cytochrome c oxidase can 
help cardiolipin into a bilayer structure (Rietveld et al., 1987). On the 
other hand, the formation of nonlamellar phases was induced by the 
addition of a designed helical TM peptide (van der Wel et al., 2000). In 
this way, the interaction of the lipids and the TM proteins determines 
the phase behavior of the entire system. 

Curvature stress could play an important role not only in the lipid 
phase but also in membrane protein stability. Amembrane protein might 
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relieve the curvature stress if it is shaped like an hourglass, that is, its 
cross section in the center of the membrane is small compared to its 
cross section at the edges. Alternatively, an egg-shaped protein would 
exacerbate the stress. The importance of curvature was demonstrated 
in refolding experiments with bacteriorhodopsin, where changes in cur- 
vature stress influenced the protein yield on refolding (Curran et al., 
1999). 

A different type of curvature in bilayers can also result from the un- 
equal distribution of lipids between the two monolayers, with one mono- 
layer having more lipid molecules than the other. This curvature is differ- 
ent from the curvature stress described above in that it originates from 
the total number of lipids on each side of the membrane and not from 
an inherent curvature in each lipid molecule. This curvature could be 
important in the budding of lipid vesicles (Huttner and Zimmerberg, 
2001) and would be expected to influence and be influenced by the 
curvature stress described above. 


4. Specific Interactions 


Membrane proteins can also have specific interactions with individual 
lipids. Bound lipids have now been observed in various high-resolution 
crystal structures of membrane proteins. A particularly striking exam- 
ple is the 1.55 A structure of bacteriorhodopsin (PDB code 1C3W) 
with 14 lipids bound per monomer (Luecke et al., 1999). Specific in- 
teractions have been found both with the head groups and with the 
hydrocarbon tails. For example, in the bacteriorhodopsin and photo- 
synthetic reaction center structures, the hydrocarbon chains can be 
seen to follow grooves on the surface of the proteins (Luecke et al., 
1999; McAuley et al., 1999). In the bacterial cytochrome c oxidase, in- 
teractions with the head groups of two phosphatidylcholine lipids are 
stabilized by salt bridges to Arg side chains (Harrenga and Michel, 
1999). The conformation of the bound lipids does not always conform to 
bilayer geometry, underscoring the influence of membrane proteins on 
their surrounding lipids. 

Some of these bound lipids may play significant roles in stabilizing the 
folded structure of membrane proteins. Indeed, removal of cardiolipin 
from cytochrome c oxidase inactivates the enzyme (Abramovitch et al., 
1990). Specific lipid interactions may also play a role in the folding 
process. Dowhan and co-workers have shown that Lac permease will not 
fold properly in the absence of phosphatidylethanolamine (PE), but 
once folded, it does not require PE (Bogdanov and Dowhan, 1999). 
Thus, PE appears to possess the properties of a chaperone, perhaps 
stabilizing an obligatory folding intermediate. With the high-resolution 
structures in hand, it should now be possible to probe the contributions 
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of some of these lipid interactions using a combination of mutagenesis 
and lipid analogues. 

The entire thermodynamic system of the membrane and TM pro- 
tein must be considered to understand how the protein and bilayer 
achieve their native state. We have summarized four of the mecha- 
nisms, hydrophobic matching, tilt angles, and specific protein/lipid and 
protein/protein interactions that are important in determining the sta- 
bility (Fig. 5). Other important factors, such as the stability of lipid/lipid 
interactions, have been left out of our protein-centric view. We describe a 
hydrophobic mismatch as an unfavorable interaction that can be relieved 
by the other three processes, but we would expect all these properties 
of the system to interact. We could easily describe the same equilibria by 
saying that a strain in curvature is relieved by a hydrophobic mismatch 
or that strong protein/protein packing interactions might help relieve 
the hydrophobic mismatch or curvature stress. The complex interplay 
between all these interactions is at the heart of what determines mem- 
brane protein stability and will no doubt be difficult to quantify. 


B. van der Waals Interactions 


Any cursory look at the structures of membrane proteins will reveal 
that the side chains between neighboring helices make close contacts 
and interdigitate, implying that these van der Waals contacts stabilize 
the folded structure. The solution structure of the TM portions of gly- 
cophorin A (GpATM) suggested that van der Waals interactions may be 
a dominant force in membrane protein stabilization (MacKenzie et al., 
1997), because the structure revealed no traditional hydrogen bonds or 
charge/charge interactions. (See Section V, C on hydrogen bonding for 
an alternative possibility.) Glycophorin A dimerization occurs through a 
critical GxxxG motif in which the two Gly residues lie on the same side 
of the TM helix and allow the close approach and packing of the two 
helices. The GxxxG motif can lead to TM helix dimerization in a variety 
of sequence contexts. For example, the GxxxG motif is sufficient to in- 
duce dimerization of polyvaline and polymethione helices (Brosig and 
Langosch, 1998). Furthermore, in a library of random TM sequences 
that oligomerize, GxxxG was commonly observed (Russ and Engelman, 
1999, 2000). Interestingly, GxxxG was found to be statistically overrepre- 
sented in a database of TM segment sequences (Arkin and Brunger, 
1998; Senes et al., 2000). All of these lines of evidence demonstrate 
that the GxxxG motif is sufficient to cause TM helix association, even 
without the use of charge/charge interactions or traditional hydrogen 
bonds. 
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The interest in van der Waals forces as a primary determinant of TM 
helix association prompted studies on the efficiency of packing of mem- 
brane proteins. If van der Waals interactions dominate membrane pro- 
tein structures, it might be expected that membrane proteins would be 
very well packed together. Are membrane proteins better packed than 
soluble proteins, in which the hydrophobic effect can operate? At the 
moment the answer is not clear. In the earliest examination of mem- 
brane protein packing, Rees et al. (1989b) found the photosynthetic 
reaction center was about as well packed as soluble proteins. In a more 
recent examination, Adamian and Liang (2001) found that residues in 
TM helices are more likely to contact an empty space or “pocket” than 
helical residues in soluble proteins, suggesting that membrane proteins 
may be less well packed. In contrast, Eilers et al. (2000) compared the 
packing of helical residues in seven different helical membrane proteins 
to those of 37 soluble proteins and concluded that the TM helices were 
generally more tightly packed. The membrane proteins examined had 
an average packing value of 0.431 compared to 0.405 for the soluble 
proteins. The packing values of individual membrane proteins span a 
broad range, however, from 0.389 to 0.469, compared to 0.333 to 0.456 
for soluble proteins. Thus, if membrane proteins are better packed than 
soluble proteins, it appears the difference is modest. 

If membrane proteins are not dramatically better packed that soluble 
proteins, does this suggest that van der Waals interactions are not impor- 
tant? Certainly not. As noted early on in the study of proteins (Richards, 
1977), soluble proteins are very well packed, so there is perhaps little 
room for improvement. There is clearly a strong driving force in both 
environments to pack well. In both cases for van der Waals interactions to 
favor the folded state over an unfolded state, the protein must pack bet- 
ter with itself than with the solvent. Thus, well-packed proteins, making 
extensive van der Waals contacts, occur within and outside membrane 
bilayers. Even if TM helices are not more tightly packed than soluble 
helices, van der Waals forces probably play a stronger role in specifying 
TM helix association because of the reduced role of the hydrophobic 
effect and the paucity of charged residues. 


C. Hydrogen Bonding 


1. Traditional Hydrogen Bonds Involving Nitrogen and Oxygen 


Hydrogen bonds can be extremely stable in the membrane where 
there is a low dielectric and limited competition from water. The 
energetic cost of moving a non-hydrogen-bonded peptide unit into a 


38 CHAMBERLAIN ET AL. 


hydrophobic environment from water is much higher than the cost of 
burying the same peptide unit when it forms a hydrogen bond. For 
example, the transfer free energy of a non-hydrogen-bonded peptide 
unit from water into carbon tetrachloride is ~6 kcal/mol, whereas a 
hydrogen-bonded peptide unit costs only ~0.6 kcal/mol (see White 
and Wimley, 1998; White et al., 1998; White and Wimley, 1999 and ref- 
erences therein). These values imply that, within the membrane, the 
hydrogen-bonded peptide unit is more stable by ~5 kcal/mol over the 
non-hydrogen-bonded peptide unit. Thus, the cost of placing an unsatis- 
fied hydrogen bond in the bilayer is enormous, implying that essentially 
all polar groups in the membrane must be hydrogen bonded. The im- 
portance of satisfying potential hydrogen bonds is illustrated by Smith 
and co-workers. Their magic-angle spinning NMR experiments suggest 
the structure of the glycophorin A dimer in bilayers is slightly different 
than the micellar structure, allowing two threonine residues to hydrogen 
bond in the dimeric interface (Smith et al., 2001). 

Astriking demonstration of the strength of hydrogen bonding in mem- 
branes came from two separate efforts to design TM peptide oligomers 
based on the soluble GCN4 leucine zipper peptide (Choma et al., 2000; 
Zhou et al., 2000). In both studies, hydrophobic peptides were designed 
that preserved the residues buried in the GCN4 dimer, including a sin- 
gle Asn residue which hydrogen bonds across the GCN4 dimer inter- 
face. The designed peptides were found to oligomerize into dimers and 
trimers, but surprisingly, Asn was the only residue critical for oligomeriza- 
tion. In fact, a single Asn in a polyleucine sequence is sufficient to drive 
oligomerization. Subsequently both groups examined the oligomeriza- 
tion of TM helices with a series of amino acid substitutions at a single po- 
sition. Residues with two potential hydrogen bonding atoms (Asn, Asp, 
Gln, and Glu) were very effective in driving oligomerization, whereas 
other residues were not (Gratkowski et al., 2001; Zhou et al., 2001). 

Interhelical hydrogen bonds are fairly common in membrane protein 
structures. Adamian and Liang (2002) examined the known membrane 
protein structures and found that interacting helices that have interheli- 
cal hydrogen bonds make more intimate contacts than helices that inter- 
act without traditional hydrogen bonds. Serine was the most frequent 
amino acid in hydrogen bonds, followed by tyrosine, histidine, threo- 
nine, and arginine. Serine residues were frequently seen to be spaced 
by seven residues, creating “serine zippers” in analogy to the leucine 
zippers of coiled-coil peptides (O’Shea et al., 1991). 

The recent structure of the glycerol-conducting channel shows a 
unique feature that at first seems to undermine the importance of hydro- 
gen bonding in TM regions: two helices transverse only halfway through 
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the membrane (Fu et al., 2000). A helix that ends in the membrane 
core would normally leave unsatisfied backbone hydrogen bonds. The 
N-termini of these half-helices, M3 and M7, have conserved NPA (Asn- 
Pro-Ala) motifs, however, and come into contact in the center of the 
membrane to form a network of hydrogen bonds. The backbone of 
three turn residues makes hydrogen bonds with the Asn side chains 
in the NPA motifs. The OD1 atom of Asn-68 hydrogen bonds with the 
NH of Ala-70 (N---O distance = 2.93 A) as does the ND2 of Asn-203 
with the backbone O of Leu-67 (N---O distance = 2.72 A). There- 
fore, although the existence of helical termini in the TM region would 
not be expected from the energetics of hydrogen bond formation, the 
glycerol-conducting channel shows how a protein can use nonstandard 
conformations to satisfy the hydrogen-bonding potential. 

As pointed out by Zhou et al. (2001), the strength of hydrogen bond- 
ing in the bilayer presents a significant danger. Mutations that introduce 
polar side chains in the bilayer could distort structures or lead to the for- 
mation of inappropriate complexes in the drive to satisfy hydrogen bond- 
ing potential. Thus, polar residues may need to be carefully protected 
in membrane protein structures. Indeed, the neu oncogene, a mutant of 
the neu/erbB-2 tyrosine kinase receptor, bears a Val-to-Glu mutation in 
the transmembrane helix. The mutation is thought to enhance dimer- 
ization of the receptor leading to cell transformation (Weiner et al., 
1989). Smith et al. (1996) have shown that this Glu does indeed form a 
hydrogen bond in a bilayer. A second example occurs in the cystic fibro- 
sis transmembrane conductance regulator. Indirect evidence suggests 
that a point mutant causing a mild form of the disease, V232D, disrupts 
the structure through the formation of a hydrogen bond between the 
D232 and Q207 (Therien et al., 2001). These examples illustrate how a 
fundamental understanding of physical forces can lead to a molecular 
understanding of disease. 


2. Nontraditional Hydrogen Bonds 


CH ---O hydrogen bonds are weak hydrogen bonds that are known 
to influence small molecule energetics and are gaining acceptance as a 
force that stabilizes proteins. CH - -- O bonds form when an electronega- 
tive carbon atom (the donor) and an oxygen atom (the acceptor) share 
a hydrogen atom. Like all hydrogen bonds, they include dipole/dipole, 
monopole/monopole, and van der Waals interactions (Desiraju and 
Steiner, 1999). The strength of the hydrogen bond is proportional to 
the strength of the interacting dipoles making hydrogen bonds with 
oxygen and nitrogen atoms relatively strong. Weaker hydrogen bonds 
can be made with CH---O,CH---N, OH-::-z, or NH.. -x interactions 
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(x refers to a x electron cloud). In proteins, the strongest CH---O 
bonds occur with histidine Ce or glycine Ca atoms as hydrogen donors, 
because they are the most polar carbons. The dipole/dipole interac- 
tions make hydrogen bonds directional, as opposed to van der Waals 
interactions that are radially symmetric. CH... O hydrogen bonds, like 
all hydrogen bonds, can be identified by having two characteristics: 
(1) the proper geometric structure, and (2) an influence on the ener- 
getics, most often seen as a reduced C—H stretching frequency in the 
infrared or Raman spectra (Green, 1974). In proteins, hydrogen bonds 
must simply be identified by the geometry alone, because of the difficulty 
in isolating one specific donor-hydrogen resonance in IR spectroscopy. 

CH ---O bonds are well established in small molecule chemistry. In 
1937, CH - - - O bonds were first proposed to contribute to intermolecular 
interactions [see Green (1974)]. Ramachandran used CH ---O bonds 
to help explain the structures of polyglycine and collagen in the 1960s 
(Ramachandran and Sasisekharan, 1965; Ramachandran et al., 1966). A 
survey of 113 small-molecule crystal structures in the Cambridge Struc- 
tural Database (Taylor and Kennard, 1982) found various features that 
suggest the cohesiveness of CH---O interactions, including (1) a hy- 
drogen bound to a carbon has a statistical preference to associate with 
O over another C or H, (2) the H---O distance is often less than the 
sum of their van der Waals radii, and (3) the CH---O geometry is sim- 
ilar to the typical OH ---O hydrogen bond geometry. The enthalpy of 
formation of the CH---O bond in N,N-dimethylforamide dimers has 
been estimated with high-level, quantum-mechanical calculations to be 
a somewhat surprising —3.0 + 0.5 kcal/mol, or about half the strength of 
a NH ---O hydrogen bond (Vargas et al., 2000). This suggests that these 
interactions could be quite significant in stabilizing protein structures. 

Derewenda et al. (1995) analyzed the CH --- O contacts in 13 protein 
structures and found that a large proportion of short CH ---O contacts 
show geometric configurations typical of hydrogen bonds. Most of these 
close CH---O interactions involved Ha protons in ß-sheet structures, 
while short CH---O contacts were less common in helices. The same 
group has identified a CH---O bond with the histidine Ce in the ac- 
tive site of serine hydrolases (Derewenda et al., 1994). Meadows et al. 
(2000) describe a CH - - - O interaction in lysozyme with good geometry 
(distance H---O = 2.76A, angle CH---O = 158°). In addition, the Cô 
atom of proline can form a CH ---O bonds with a backbone oxygen to 
replace the NH---O bond in the backbone of a-helices (Chakrabarti 
and Chakrabarti, 1998). 

Although considered as “weak” hydrogen bonds, CH ---O bonds are 
more likely to have a significant energetic impact in the membrane with 
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its low dielectric constant. The prevalence of glycine in TM helix inter- 
faces has been noted (Javadpour et al., 1999; Russ and Engelman, 2000) 
and is often attributed to a lack of side-chain entropy costs upon fold- 
ing or the ability to faciliate closer, more intimate helix associations. 
Javadpour et al. (1999) have suggested that CH - - - O bonds may help ex- 
plain the predominance of glycine in helical interfaces, however, noting 
the polar interactions of Gly-420 and Gly-457 cytochrome c oxidase as an 
example. Senes et al. (2001) surveyed 11 structures of TM helix proteins 
specifically to assess the possible role of CH---O bonds in TM helix 
association. Their analysis identified potential CH - -- O bonds between 
Ca-H - - - O as well as CB-H---O and Cy-H---O. The ratio of the num- 
ber of CH: --O interactions involving Ca to the number involving Cf 
and Cy increases when the H---O distance is less than 2.7 A. This re- 
sult is consistent with the stronger dipole moment of the Ca—H atoms 
exerting a larger attractive force on the acceptor dipole. They identified 
networks of the CH - - - O bonds reminiscent of the traditional hydrogen 
bonding networks described by Adamian and Liang (2002). In particu- 
lar, as shown in Fig. 8 (see color insert), the GxxxG motif in glycophorin 
ATM forms three CH---O bonds, two involving the HA2 protons of 
glycine and one involving the HA proton of the valine residing in the 
first “x” position of the GxxxG motif. Of helical glycine residues, 23% 
were found to be CH---O bond donors and 10% were CH---O bond 
acceptors. Additionally, 24% of serine residues and 20% of threonine 
residues were involved in CH ---O bonds. These results imply that the 
use of CH - --O bonds strongly influences the association of TM helices, 
especially in the absence of the stronger, traditional hydrogen bonds 
with N and O donor atoms. Thus, it is possible that the relatively weak 
CH - -- O bonds play an important role, along with van der Waals and tra- 
ditional hydrogen bonds, in determining in the stability and specificity 
of TM associations. An experimental demonstration of the strength of 
these interactions is still needed, however, before we can assess their 
influence on membrane protein structures. 


D. Salt Bridges 


Because of the extremely large energetic cost of moving a charge 
from the high dielectric environment of water to the low dielectric of 
the membrane, it is unlikely that isolated, charged side chains would 
exist in the membrane. It is much more likely that they would exist in a 
neutral form (Smith et al., 1996) or be neutralized by interaction with an 
oppositely charged side chain, that is, form a salt bridge. To our knowl- 
edge there has not been a systematic study of salt bridges in membrane 
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proteins, but we would expect them to be relatively rare in TM helices 
because of the low abundance of charged residues (Senes et al., 2000). 
There is indirect evidence, however, for important salt-bridge interac- 
tions in membrane proteins. For example, an Asp and Lys in two TM 
helices of Lac permease appear to interact. The permease activity is re- 
tained if the positions of both the Asp and Lys are exchanged in the 
double mutant D237K/K358D, or if both charges are neutralized, as in 
the mutant D237C/K358C (King etal., 1991; Sahin-Toth et al., 1992). The 
individual point mutants, with only one charge neutralized, are inactive. 
Second, an Ala to Lys mutation, A19K, in the aspartate chemoreceptor, 
Tar, abolishes activity, while some second site suppressor mutations that 
could form an interhelical salt bridge restore activity (Unemura et al., 
1998). Other examples of possible TM salt bridges are found in the 
P;-linked antiport carrier (Hall and Maloney, 2001), gastrin-releasing 
peptide receptor (Donohue etal., 1999), ou, adrenergic receptor (Porter 
and Perez, 1999), cystic fibrosis transmembrane conductance regulator 
(Cotten and Welsh, 1999), and T-cell receptor assembly (Cosson et al., 
1991). Salt bridges have also been documented in side-chain interac- 
tions with lipid head groups. For example, in the bacterial cytochrome c 
oxidase structure, Arg side chains interact with the phosphate of a phos- 
phatidylcholine head group (Harrenga and Michel, 1999). To the ex- 
tent that salt bridges exist in membrane proteins, it seems likely that 
they would predominate in the interfacial region where they would be 
more accessible to lipid head groups and water. Nevertheless, more work 
is needed to better assess the role of salt bridges in membrane protein 
structures. 


VI. CONCLUSIONS 


Unlike the case of soluble proteins in which the hydrophobic effect 
dominates folding, it is still not possible to point to a force that domi- 
nates in the construction of helix bundle membrane proteins. Clearly 
hydrogen bonds can be extremely strong, but are relatively sparse and 
are not critical for the development of stable helix—helix interactions. 
Van der Waals interactions are certainly important, but the fact that a 
well-packed leucine zipper interface is not sufficient to drive TM he- 
lix association suggests that packing alone is not enough. Although the 
complex interplay between lipid structure and protein structure must 
play an important role, the fact that many membrane proteins remain 
folded and functional in detergent micelles suggests that lipid structure 
alone cannot entirely explain membrane protein architecture. Thus, 
it would seem that membrane protein structure is defined by a subtle 
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balancing of many small interactions. This is of course the worst possible 
situation for workers addressing the structure prediction problem as it 
may not be possible to focus on one aspect of stability. Certainly, we will 
need to develop a much better understanding of the molecular forces 
operating in the membrane environment. Perhaps our most acute need 
is a better understanding of lipid-protein interactions and weakly polar 
interactions in the membrane—a major challenge for the future. 
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I. INTRODUCTION 


After numerous complete genomic DNA sequences had been estab- 
lished and after these had been interpreted to a great extent in terms 
of protein sequences, it became evident that about 20% of all proteins 
are located in membranes (Liu and Rost, 2001). This number was de- 
rived from a search for transmembrane «-helices with a computerized 
prediction system, the results of which are known to come with a high 
confidence level. Such helices can be recognized from a continuous 
stretch of 20 to 30 nonpolar residues with a predominance of aliphatic 
side chains at the center and aromatic residues at both ends (Sipos and 
von Heijne, 1993). The number of transmembrane a-helices per protein 
is broadly distributed and averages around six. 

The main chain amides of these «-helices are all locally comple- 
mented, so that the interface to the nonpolar membrane interior is 
exclusively formed by the nonpolar side chains. This explains the use- 
fulness of an a-helix as a membrane-crossing element. The helix ori- 
entation can be deduced from the charge patterns of the interhelical 
segments, positive charges being inside and negative charges outside 
the cell. As a consequence the coarse structures, usually called topolo- 
gies, of all these a-helical membrane proteins can be assessed from the 
sequence. The presence of additional ß-sheets in these proteins is dis- 
cussed. However, these must be internal because it cannot be expected 
that the membrane is faced by a mixture of a-helices and f-sheets as 
the main chain hydrogen bond donors and acceptors at the sheet edges 
cannot be complemented by those of a-helices. 
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Because of their construction, transmembrane f-sheets have an amide 
saturation problem at their edge strands. Fusing both edges to form a 
barrel, however, permits complementation of all amide hydrogen bond 
donors and acceptors. Like a transmembrane helix, a ß-barrel can face 
the nonpolar interior of the membrane if its outer surface is coated 
with nonpolar side chains. Such barrels occur, indeed, in the outer 
membrane of gram-negative bacteria. They should be detectable in the 
sequence because every second residue is nonpolar. The significance 
of this information is low, however, because the individual $-strands 
are only slightly more than half a dozen residues long and the inter- 
mittent residues pointing to the barrel interior can be both polar and 
nonpolar. 

From the start the 6-barrels require cooperative folding of a polypep- 
tide chain of 100 or more residues which constitute an entropic hurdle. 
In contrast, an a-helical transmembrane protein can traverse the mem- 
brane as soon as a local segment of about 20 residues becomes nonpolar. 
The remaining transmembrane part can then be added piece by piece, 
which is entropically much more favorable. Therefore the -barrel mem- 
brane proteins arose probably rather late during protein structure evolu- 
tion, constituting an addition to the much simpler a@-helical membrane 
proteins. 

At present, transmembrane f-barrel proteins have been found exclu- 
sively in the outer membrane of gram-negative prokaryotes; these mem- 
branes seem to lack a-helical proteins. Accordingly, a separation exists 
between «-proteins in all cytoplasmic membranes and ß-proteins in the 
specialized outer membranes. Following the endosymbiotic hypothesis, 
B-proteins are also expected in the outer membranes of mitochondria 
and chloroplasts, but none of these proteins has yet been structurally elu- 
cidated. Given the limited abundance of such membranes, the ß-proteins 
are likely to make up only a small, special class of membrane proteins. 

In contrast to their limited importance in nature, the ß-barrel pro- 
teins are most prominent in the list of established membrane protein 
structures. Moreover, they show a high degree of internal chain-fold 
symmetry and therefore convey the impression of beautiful proteins 
(Fig. 1). One should not forget that the first protein structure, myo- 
globin, caused some disappointment among those who solved it as it 
showed no symmetry whatsoever; even the a-helices were not whole- 
numbered but about 3.6 residues per turn. Accordingly, the symmetric 
transmembrane f-barrels stand out from the bulk of asymmetric chain 
folds of water-soluble proteins. 

The number of distinct chain folds of integral membrane proteins 
is probably much smaller than the respective number for water-soluble 
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Fic. 1. Ribbon plot of the 16-stranded f-barrel of the general porin from 
Rhodobacter capsulatus viewed from the molecular threefold axis (Weiss and Schulz, 1992). 
Note the large variation of the inclination angle o and the difference between the high 
barrel wall facing the membrane and the low wall at the trimer interface. 


proteins, which ranges around a thousand (Schulz, 1981; Brenner and 
Levitt, 2000). Proteins of the cytoplasmic membrane consist mostly of 
transmembrane a-helices, and the bacterial outer membrane proteins 
contain ß-barrels. Both types show a high neighborhood correlation 
which limits the number of different topologies appreciably (Schulz 
and Schirmer, 1979). The a-helices run, in general, perpendicular to 
the membrane plane and connections are formed between neighboring 
helix ends (Bowie, 1999). All transmembrane f-barrels contain mean- 
dering all-next-neighbor antiparallel sheets, the topologies of which are 
completely described by the number of strands. 


II. STRUCTURES 


Here we discuss structures that have been established at the atomic 
level revealing the exact conformation of the polypeptide chain. All were 
determined by X-ray diffraction analysis of a threedimensional protein 
crystal. Some «-helical membrane protein structures have been ana- 
lyzed by electron diffraction of two-dimensional crystals, although gen- 
erally with a lower accuracy. For a long time structural analyses by NMR 
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TABLE I 
B-Barrels in Water-Soluble Proteins 








n S R(A) a(°) Secondary structure 
LH proteins 1 18 Left-handed £-helix 
Metalloprotease 1 18 — — Right-handed £-helix 
Pectate lyase 1 20 — — Right-handed £-helix 
Chymotrypsin 6 8 6.5 45 Symmetric antiparallel -sheet 
TIM 8 8 7.2 37 All-next-neighbor parallel -sheet 
Streptavidin 8 10 7.9 43 All-next-neighbor antiparallel 6-sheet 
Lipocalins 8 12 8.7 48 All-next-neighbor antiparallel 6-sheet 


had been considered inapplicable to membrane proteins, because these 
proteins can only be solubilized in large micelles. However, two small 
membrane proteins have now been tackled with this method, and suc- 
cess has been reported (Arora et al., 2001; Fernandez et al., 2001a,b). 
Moreover, the native structure of gramicidin A in the membrane was de- 
duced from solid-state NMR data (Ketchem et al., 1993). Other methods 
providing important structural information on proteins are light spec- 
troscopy, electron microscopy, and atomic force microscopy. However, 
none of them reaches atomic resolution. 

The existence of -barrels was established for chymotrypsin at a 
very early stage in the now common protein crystal structure analyses. 
This enzyme contains two distorted six-stranded f-barrels with iden- 
tical topologies (Birktoft and Blow, 1972). A selection of ß-barrels in 
water-soluble proteins is given in Table I. The very abundant TIM-barrel 
consisting of eight parallel -strands was also detected rather early 
(Banner et al., 1975). Additional eight-stranded f-barrels of this group 
are those of streptavidin (Hendrickson et al., 1989) and of the lipocalins 
(Newcomer et al., 1984). 

The £-helices also belong to this group as they can be taken as single- 
stranded £-barrels (n = 1) with large shear numbers of S = 18 and more 
(Table I, Fig. 2). They were first detected with pectate lyase (Yoder et al., 
1993). The right-handed and left-handed versions have positive and neg- 
ative S values, respectively. The cross-sections of these £-helical barrels 
deviate drastically from circles. The cross-section of the pectate lyase type 
resembles a boomerang and those of the metalloproteases a flat ellipse 
(Baumann ei al., 1993), whereas the LEH (left-handed £-helix) types of 
proteins form triangles (Raetz and Roderick, 1995). 

X-Ray diffraction analysis is a suitable and convenient method for 
obtaining exact structures of membrane proteins, but it requires 
three-dimensional crystals. Membrane protein crystallization has always 
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Fic. 2. General architecture of a ß-barrel. The description depends neither on the se- 
quence of the strands nor on their directions. Residues are represented by their Ca atoms. 
(A) Barrel geometry with the tilt angle o of the $-strands versus the barrel axis. The hy- 
drogen bonding pattern is shown for antiparallel 6-strands. (B) The barrel is cut where 
the first strand reaches the upper end, flattened out, and viewed from the outside. The 
sequence of the n strands along the circumference is arbitrary. The thin line follows a 
pleat of the sheet, that is, the hydrogen bonds. The shear number S is derived by running 
from a given strand to the left along the hydrogen bonds once around the barrel and 
counting the residue number S to the point of return to the same strand (Murzin et al., 
1994a,b; Liu, 1998). The depicted $-strand tilt corresponds to a positive S-value, and a 
tilt to the left to a negative S. 


been a bottleneck. Part of this obstacle is the preparation of suffi- 
cient homogeneous membrane protein material, because the limited 
volume of the two-dimensional entity membrane cannot incorporate 
large amounts of a recombinant protein. Moreover, any tampering with 
the membrane is highly hazardous for the respective organism so that 
high expression levels are generally rare. 

This problem was circumvented by expressing membrane proteins 
into the cytosol and (re)naturing it from there into micelles (Schmid 
et al., 1996), which is possible for a number of ß-barrel proteins. As a 
general observation the crystallization of the bacterial outer membrane 
proteins appears to be easier than that of the «-helical proteins from 
the plasma membrane. Accordingly, the list of the structurally estab- 
lished ß-barrel membrane proteins is comparatively long. It is given in 
Table II. The resolution of the analyses ranges from 1.6 A for OmpA 
(neglecting the nonnative gramicidin-A crystals) to 3.2 A for the porin 
OmpC (OmpK36). The crystals are usually loosely packed, except for 
one crystal form of OmpA that reached 50% (v/v) protein in the crystal 
but diffracted merely to medium resolution (Pautsch and Schulz, 2000). 
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TABLE II 
Transmembrane B-Barrel Proteins“ 





n S Bäi oer Oligomer Reference 
Gramicidin A 1 6 3.4 77 Dimer Ketchem et al., 1993 
(native) 
Gramicidin A 2 —6 Cs? fits — Dimer Wallace and 
(CsCl, CH3OH)‘ Ravikumar, 1988 
Gramicidin A 2 —4 narrow — Dimer Langs, 1988 
(C6He, ethanol)“ 
Nanotube 1 ‘8’ 5.6 90 Stack Ghadiri et al., 1994 
OmpX 8 8 7.2 37 Mono Vogt and Schulz, 
1999 
OmpA 8 10 7.9 43 Mono Pautsch and Schulz, 
1998 
OmpT 10 12 9.5 42 Mono Vandeputte-Rutten 
et al., 2001 
OmpLA! 12 12 10.6 37 Mono Snijder et al., 1999 
TolC’ 12 ‘20° 13.6 51 “Trimer” Koronakis et al., 2000 
a-Hemolysin’ 14 ‘14 12.3 37 “Hepta” Song et al., 1996 
Porin R. capsulatus 16 20 15.5 43 Trimer Weiss et al., 1990 
Porin OmpF 16 20 15.5 43 Trimer Cowan et al., 1992 
(PhoE, OmpC) 
Porin R. blasticus 16 20 15.5 43 Trimer Kreusch and Schulz, 
1994 
Porin P. denitrificans 16 20 15.5 43 Trimer Hirsch et al., 1997 
Porin Omp32 16 20 15.5 43 Trimer Zeth et al., 2000 
Maltoporin (two 18 20 17.1 40 Trimer Schirmer et al., 1995 
species) 
Sucrose porin 18 20 17.1 40 Trimer Forst et al., 1998 
FhuA 22 24 19.9 39 Mono Locher et al., 1998 
FepA 22 24 19.9 39 Mono Buchanan et al., 1999 


“ All sheets are antiparallel except for native gramicidin A. The topologies are always 
all-next-neighbor. 

P The radius is calculated for a circular cross section. The angle a can vary by +15° 
around the barrel. 

° Nonnative conformations of short peptides depend on the crystallization condition. 

“ This enzyme exists as a monomer in the membrane and becomes active on dimer- 
ization. 

° The barrel itself consists of parts from several subunits, violating the rule of one barrel 
per subunit. 





The first data showing a transmembrane barrel came from electron 
microscopy and electron diffraction of two-dimensional porin crystals 
(Jap, 1989). They followed spectroscopic studies of the amide bands in 
the infrared range indicating that the porin contains f-strands tilted 
at a 45° angle against the membrane plane (Nabredyk et al., 1988). As 


TRANSMEMBRANE ß-BARREL PROTEINS 53 


can be visualized in Fig. 2A, a 45° tilt in a barrel can point either to 
the right or to the left, the two alternatives being mirror images. This 
ambiguity is resolved, however, because only one of them corresponds 
to the energetically favorable and therefore commonly observed ß-sheet 
twist. In conjunction with the barrel diameter known from the electron 
diffraction data, these data would have sufficed for modeling the 6-barrel 
of the analyzed porin in a rather detailed manner. Shortly thereafter an 
X-ray analysis of three-dimensional porin crystals clarified this structure 
in atomic detail (Weiss et al., 1990). 

The pentadecameric antibiotic peptide gramicidin A forms channels 
through membranes that allow the passage of alkali ions. It has been 
included in Table II because it forms -barrels in two artificial con- 
formations as well as in its native conformation. The artificial con- 
formations were obtained by crystallizing from nonpolar solvents in 
the presence and absence of Cst ions (Wallace and Ravikumar, 1988; 
Langs, 1988). They showed very narrow ß-barrels, indicating that they 
present artifacts instead of channels. The real structure of grami- 
cidin A consists of a somewhat wider barrel in the membrane, which 
could only be determined by solid-state NMR of membranes contain- 
ing gramicidin A (Ketchem et al., 1993). It revealed a helical n= 1 
type of ß-barrel, two of which associate head-to-head forming a chan- 
nel through the membrane. Such narrow barrels can only be assumed 
if L-amino acid residues alternate with D-amino acids (or glycines) 
along the peptide chain, and this is here actually the case. The arti- 
ficial nanotubes listed in Table II follow the design of the gramicidin 
A channel except that the chain is an eight-membered ring instead 
of a ß-helix (Ghadiri et al., 1994). The rings are stacked forming a 
channel. 

The smallest established transmembrane f-barrel of the canoni- 
cal type is that of OmpX (Vogt and Schulz, 1999). It contains eight 
strands with a shear number S=8 and appears to represent the 
minimum construction for a transmembrane f-protein (Table II). In 
contrast to OmpX, the ubiquitous outer membrane protein A pos- 
sesses an N-terminal 171-residue domain (here called OmpA) as a mem- 
brane anchor and a C-terminal periplasmic domain binding to the 
peptidoglycan cell wall. OmpA contains an eight-stranded f-barrel with a 
shear number S = 10, which is larger than that of OmpX, giving rise to a 
larger barrel cross-section (Pautsch and Schulz, 1998). The relationship 
between strand numbers, shear numbers, and barrel radii is illustrated in 
Fig. 3. Applying point mutations, the OmpA crystals could be improved 
to diffract to 1.6 A resolution (Pautsch and Schulz, 2000). This allowed 
for an anisotropic structure refinement revealing the major mobility di- 
rections of loops and turns. It demonstrated that the turns and loops 
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Fic. 3. The observed f-barrels (no f-helices included) concentrate at tilt angles 
a between 35° and 50°. Because of the two-residue repeat in the hydrogen bond pattern 
(Fig. 2A), the shear number S is always even. The radius R of the barrel increases with 
the number of strands n as well as with the shear number S. Up to OmpLA (n= S = 12) 
the interior of the barrel can be filled by side chains and fixed water molecules. 


have asymmetric mobilities which correspond to those of a model in 
which the polypeptide is replaced by a resilient wire. 

The bacterial outer membranes also contain some enzymes, among 
them the protease OmpT consisting of a 10-stranded £-barrel with a 
shear number A = 12 (Vandeputte-Rutten etal., 2001). While the “lower” 
part of the f-barrel is immersed in the membrane as usual, its “upper” 
part protrudes into the external medium and contains the catalytic cen- 
ter where foreign proteins are split. OmpT is of medical interest be- 
cause it contributes to the pathogenicity of bacteria. A further surfacial 
enzyme is the phospholipase A (OmpLA) that destroys lipopolysaccha- 
rides. It consists of a 12-stranded £-barrel with a shear number S = 12 
(Snijder et al., 1999). Its barrel contains a solid interior hydrogen bond- 
ing network without a pore, a nonpolar outer surface, and the catalytic 
center at the external end. OmpLA is active as a dimer accommodated 
in the membrane. Interestingly, the dimer interface is for the most part 
nonpolar. 

The £-barrels of TolC (Koronakis et al., 2000) and of a-hemolysin 
(Song et al., 1996) are special because they are composed of several pieces 
coming from different subunits (Table I). The TolC barrel consists of 
three four-stranded all-next-neighbor antiparallel $-sheet pieces coming 
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from the three subunits. The major parts of the subunits are a-helical 
and not in the outer membrane. The shear number S is as large as 20, 
giving rise to a wide channel along the barrel axis (see Fig. 3). This is 
in contrast with the 12-stranded 6-barrel of OmpLA that has a smaller 
S and a solid core. The a-hemolysin barrel consists of seven £-hairpin 
loops coming from the seven subunits. Each subunit is water-soluble. 
The f-hairpin loop undergoes a large conformational change during 
the cooperative process of $-barrel formation, which is likely to occur 
on membrane insertion and after the large globular parts have formed 
an annular heptamer (Olson et al., 1999). 

The porins in the outer membrane of gram-negative bacteria are pas- 
sive channels showing various grades of selectivity. The most common 
type consists of 16-stranded £-barrels with a shear number S = 20. The 
structure of the porin from Rhodobacter capsulatus illustrated in Fig. 1 was 
the first transmembrane f-barrel structure to be established at atomic 
resolution (Weiss et al., 1990). It revealed the general construction 
principles (Schulz, 1992, 1994), which were subsequently also observed 
in other membrane proteins (e.g., the aromatic girdles), in other 
transmembrane f-barrels (e.g., the short periplasmic turns), and in 
other porins of this type (e.g., the transversal electric field for polarity 
separation). The structures of two channels that are highly selective for 
maltooligosaccharides and sucrose, respectively, showed 18-stranded 
barrels with kidney-shaped cross sections (Schirmer et al., 1995; Meyer 
et al., 1997; Forst et al., 1999). These cross sections deviate strongly 
from circles and allow long narrow channels which are required for the 
selection process. 

The largest -barrels have been observed with the monomeric iron 
transporter proteins FhuA and FepA. The structure of FhuA was estab- 
lished independently by two groups (Locher et al., 1998; Ferguson et al., 
1998). It is known with and without a ligated siderophore. The struc- 
ture of the ferric enterobactin receptor FepA is homologous to that of 
FhuA showing identical topology and a similar transport mechanism 
(Buchanan et al., 1999). In both cases there are more than 700 residues 
assembled in two domains: an N-terminal 150-residue domain is located 
inside a C-terminal 22-stranded £$-barrel with a shear number S = 24. 


III. CONSTRUCTION PRINCIPLES 


The general structural properties of 6-barrels are illustrated in Fig. 2. 
The barrel is sketched as a cylinder with circular cross section and 
all £-strands are assumed to run at the same inclination angle a. The 
-pleated sheet parameters a = 3.3 Å and b = 4.4 A refer to all kinds of 
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B-sheets: parallel, antiparallel, or mixed. The hydrogen bonds in Fig. 
2A are sketched for the predominant antiparallel sheet. When cutting 
the cylindrical barrel at the place where the first strand reaches the 
upper barrel end and flattening it out, the relationship between the 
number of strands n, the shear number S, and the tilt angle o becomes 
obvious: 


R= [(Sa)? + (nb)?]? Jar 
tana = Sa/nb 


R 


nb/27 cosa 


Negative values of the shear number S are observed in the extreme 
cases of left-handed £-helices (Table I) and gramicidin-A artifacts 
(Table II). In canonical -barrels S is positive and ranges between n and 
n + 4, allowing for an optimum £-sheet twist. TolC with S = n+ 8 is 
an exception. Furthermore, S is always an even number because after 
running around the barrel, ridges and valleys of the pleated sheet have 
to be joined to ridges and valleys again. In other words, the hydrogen 
bonds repeat only every second residue. 

The relationship between n, S, a and R in ß-barrels is shown graph- 
ically in Fig. 2B and in Fig. 3. The smallest barrel considered has six 
strands. Two strongly distorted copies of it were found in chymotrypsin 
(Table I). Eight-stranded $-barrels are more regular and much more 
common; the most prominent example is the TIM barrel. Among the 
water-soluble proteins a series of eight-stranded barrels exists with shear 
numbers ranging from 8 to 12 in TIM, streptavidin, and lipocalins, re- 
spectively. In all these cases the barrel interior contains a hydropho- 
bic core. In TIM barrels the active centers are invariably found at the 
carboxy-terminal end of the barrel. Streptavidin binds biotin at one bar- 
rel end. The same applies for lipocalins where the bound large nonpolar 
compounds reach down to the barrel center. The increasing ligand sizes 
and binding site depths from TIM to the lipocalins correspond to the 
increasing barrel radii caused by larger shear numbers. 

In contrast to the smaller 6-barrels of water-soluble proteins, those 
of membrane-inserted ß-barrels start at eight strands and run up to 22 
(Table II). Presumably, the required tightly packed nonpolar barrel core 
of the water-soluble proteins limits the radius of circular barrels to small 
values. In contrast, transmembrane barrels form polar cores the stabil- 
ity of which depends on hydrogen bonds rather than on geometrically 
exact nonpolar packing contacts. Such polar cores can be constructed 
much more leisurely and may also include water molecules that increase 
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the interior volume. Accordingly, the interiors of the barrels with up to 
12 strands are polar and solid, except for TolC with its exceptionally 
large shear number of 20. The large shear number gives rise to a large 
barrel radius (Fig. 3) which causes TolC to form a channel. The same 
applies for a-hemolysin, which, however, has a smaller shear number but 
two more ß-strands. 

Channels are of course also formed by all porins. A general porin con- 
tains 16 $-strands and has a shear number of 20 and a nearly circular 
cross section (Table II). Three parallel barrels associate to form trimers. 
The type of residues outlining the channel determines the specificity of 
a porin which, however, is usually not very strict. The two 18-stranded 
porins are very specific. Their channel cross-sections are actually smaller 
than those of the general porins in agreement with their higher selectiv- 
ity. The 22-stranded barrels of the iron transporter proteins have circular 
cross sections and would form a very wide channel if they were not filled 
with the globular N-terminal 150-residue domain. 

Apart from the construction principles dictated by the -barrel 
geometry and illustrated in Fig. 2, the transmembrane -barrels follow 
additional rules that are probably dictated by factors other than the co- 
valent peptide structure: 


1. Both the N- and C- termini are at the periplasmic barrel end re- 
stricting the strand number n to even values. 

2. All B-strands are antiparallel and locally connected to their next 
neighbors. 

3. On trimerization a nonpolar core is formed at the molecular three- 
fold axis of the porins so that the central part of the trimer resembles 
a water-soluble protein. 

4. The external ß-strand connections are long loops named L1, L2, 
etc., whereas the periplasmic strand connections are generally min- 
imum length turns named T1, T2, etc. 

5. Cutting the barrel as shown in Fig. 2 and placing the periplas- 
mic end at the bottom, the chain runs from the right to the 
left. 

6. In all porins the constriction at the barrel center is formed by an 
inserted long loop L3. 

7. The ß-barrel surface contacting the nonpolar membrane interior 
is coated with aliphatic side chains forming a nonpolar ribbon. The 
two rims of this ribbon are lined by girdles of aromatic side chains. 

8. The sequence variability in transmembrane ß-barrels is higher than 
in water-soluble proteins and exceptionally high in the external 
loops. 
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The first three rules are likely to reflect the folding process of the 
trimeric porins. Presumably, the central part folds in the periplasm like 
a water-soluble protein. The membrane-exposed parts of the barrels 
are then formed on insertion into the membrane. The short turns of 
rule 4 may facilitate barrel formation inside the membrane. Presumably 
rule 5 is a consequence of rule 4 because appropriate short turns can 
only be formed in one of the two possible chain directions. Rule 6 seems 
to reflect an early evolutionary event that has not yet been revised. The 
aromatic girdles of rule 7 were suggested to stabilize the ß-barrel and 
its vertical position in the membrane (Schulz, 1992). The stabilization 
has been confirmed experimentally by demonstrating the preference 
of aromatic compounds for the two nonpolar-polar transition regions 
of the membrane (Wimley and White, 1996; Yau et al., 1998). Rule 8 
came as a surprise to those with a high respect for membrane proteins 
which, of course, is mainly caused by our difficulties in solving membrane 
protein structures. Rule 8 explains these difficulties because it indicates 
that membrane proteins are subjected to fewer structural restraints than 
water-soluble ones and for this reason in general are more mobile and 
thus less crystallizable. 

Given the drastic mobility differences between the external loops and 
the membraneous and periplasmic moieties of the barrels, it was sug- 
gested that these proteins can be crystallized by creating suitable packing 
contacts through semirandom mutagenesis at loops and turns (Pautsch 
et al., 1999). Without structural knowledge, these loops and turns can 
often be predicted from the sequence and from interaction studies. For 
OmpA and OmpxX this approach resulted in surprising successes. The 
procedure is also applicable to a-helical membrane proteins inasmuch 
as their crystallization problem is governed by their small polar surfaces. 
It is quite possible, however, that they hesitate to crystallize because their 
transmembrane a-helices are stabilized by the native laminar membrane 
environment and become flexible in the detergent/lipid micelles used 
for crystallization. 

With so many rules, the prediction of transmembrane f-barrels from 
the sequence should be achievable at a high confidence level. However, 
the simple approach of looking for alternating polar and nonpolar 
residues inside and outside the barrel is not very helpful because 
this pattern is frequently broken by nonpolar residues on the inside. 
Moreover, the 6-strands are merely slightly more than half a dozen 
residues long which limits their information content appreciably. These 
problems have been tackled in several prediction programs (Welte 
et al., 1991; Schirmer and Cowan, 1993; Gromiha et al., 1997; Seshadri 
et al., 1998; Diederichs et al., 1998; Jacoboni et al., 2001) but cannot 
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be considered solved. For some time to come, the safety of ß-barrel 
prediction will remain well below that of transmembrane a-helices with 
their nonpolar 25-residue segments. 


IV. FUNCTIONS 


OmpX is synthesized in large amounts in stress situations and it is 
probably used as a defensive weapon binding to and thus interfering with 
foreign proteins. Half of the OmpX f-barrel protrudes into the external 
medium, presenting an inclined ß-sheet edge that binds to any foreign 
protein with a -strand in its surface layer (Vogt and Schulz, 1999). Such 
proteins are ubiquitous, an example being the large group of proteins 
with central parallel 6-sheets surrounded by a-helices. In X-ray analysis, 
all loop residues of OmpX were located in the electron density distribu- 
tion indicating that the -sheet edge presented to the foreign proteins 
is rigid, as it is required for tight binding. 

In contrast to OmpX the long external loops of OmpA are highly 
mobile and, on the whole, not visible in the respective electron den- 
sity map. Jn vivo, the mobile loops are rather resistant to proteolytic 
attack, presumably because they bind to the surrounding lipopolysac- 
charides. Obviously, the mobile loops fulfill essential functions in bacte- 
rial life (Morona et al., 1984). They are also known as docking points for 
bacteriophages. 

Among the outer membrane enzymes, OmpT isa special protease that 
has been implicated in the pathogenicity of bacteria. It is monomeric 
with the active center pointing to the outside (Vandeputte-Rutten et al., 
2001). Another enzyme, the phospholipase A OmpLA, produces holes in 
the outer membrane when it is activated. The activation process has not 
yet been clarified, but itis known to require a dimerization of OmpLA in 
the membrane. The activation by dimer formation has been verified by a 
crystal structure analysis of an OmpLA dimer that was produced by a re- 
action with an inhibitor (Snijder et al., 1999). It showed that each active 
center contained a catalytic triad Ser-His-Asn on one subunit and an ox- 
anion hole formed by an amide together with a hydrated Ca?" ion on the 
other. The active centers are well placed for deacylating lipopolysaccha- 
rides of the external leaflet of the outer bacterial membrane. OmpLA 
functions in the secretion of colicins and virulence factors. 

All general porins contain pores with sizes allowing the permeation 
of molecules up to molecular masses of about 600 Da (Nikaido, 1994). 
The pores come with various selectivities. The porin from Rhodobacter 
capsulatus, for instance, contains a rather nonpolar binding site near the 
external end of the pore eyelet, indicating that it may pick up molecules 
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such as adenosine at very low concentrations. The structure also revealed 
a transversal electric field across the pore eyelet that acts as a polarity sep- 
arator, excluding the unwanted nonpolar compounds (Schulz, 1992). 

Porin OmpF from Escherichia coli has been thoroughly analyzed by nu- 
merous groups and became the first membrane protein to form X-ray 
grade crystals (Garavito and Rosenbusch, 1980). It is closely homolo- 
gous to the porins PhoE (Cowan et al., 1992) and OmpC. These three 
porins show permeation properties adjusted to different environmental 
conditions. PhoE allows an efficient uptake of phosphate. OmpC from 
E. coli and its homologue OmpK36 from Klebsiella pneumoniae (Dutzler 
et al., 1999) are osmoporins expressed at high osmotic pressures usually 
caused by high salt concentrations. The high salt OmpC differs from 
the low salt OmpF mainly by an increased number of charged residues 
pointing into the pore lumen. Presumably, the charge increase counter- 
acts Debye-Hückel shielding at high ionic strength so that OmpF and 
OmpC (OmpK36) have comparable permeation properties in differing 
environments. Additional structures were established for the main porin 
from Paracoccus denitrificans (Hirsch et al., 1997) and for Omp32 from 
Comomonas acidovorans (Zeth et al., 2000). They confirmed the estab- 
lished general features of their homologues. 

All structurally established porins are aggregates of three parallel 
B-barrels, each of which contains a single polypeptide chain. The £- 
barrels contain either 16 or 18 strands. The interfaces are usually large 
and tightly packed. Therefore, it is not conceivable that the subunits are 
stable as monomers, either in the membrane or in the periplasm. How- 
ever, the existence of a functional monomeric porin has been reported 
(Conlan et al., 2000), but its structure has not yet been elucidated. 

With maltoporin the bacteria developed a particular small pore that is 
adapted to the amylose helix and accepts only glucose units (Schirmer 
et al., 1995; Meyer et al., 1997). The energetics of a maltooligosaccharide 
diffusing through such a pore has been examined in detail, revealing a 
combination of nonpolar and optimally spaced polar interactions that 
result in smooth gliding (Meyer and Schulz, 1997). This energy pro- 
file has been confirmed in a more detailed molecular dynamics study 
(Schirmer and Phale, 1999) and in an experimental study based on 
mutants (Dumas et al., 2000). The sucrose porin is a homologue of the 
maltoporin and very specific for the small molecule sucrose, in contrast 
with the maltoporin specific for larger oligomers (Forst et al., 1998). 

Besides porin trimers with 16- and 18-stranded £-barrels, even larger 
22-stranded -barrel proteins were found in the outer membrane, 
namely the monomeric active iron transporters FhuA and FepA. The 
lack of ATP or an equivalent energy carrier in the periplasm restricts the 
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outer membrane, in the first place, to more or less specific but passive 
pores that are not able to transport any solute against a concentration 
gradient. Bacteria overcame this problem by inventing plugged £-barrels 
and the TonB apparatus. 

The structure of the two evolutionarily related plugged pores FhuA 
and FepA are known. Their f-barrels have diameters of about 40 A 
(Table II). They bind siderophore-encapsulated iron in the external half 
of the barrel and are obstructed by an N-terminal 150-residue domain 
in the periplasmic barrel half. Mutational studies revealed their TonB 
binding sites. The directed iron transport through the outer membrane 
is energized by an interaction with TonB of the inner membrane that can 
draw energy from the cytosolic ATP pool (Rutz et al., 1992; Larsen et al., 
1999). The plug formed by the 150-residue domain is removed after 
binding to TonB, making the siderophore available for internalization. 

A completely different principle has been followed by the heptameric 
a-hemolysines (Song et al., 1996; Olson et al., 1999). These proteins as- 
sociate with their extramembrane domains. Subsequently, each subunit 
donates a ß-hairpin to form a common 14-stranded f-barrel through the 
membrane. In a similar manner TolC is assembled from three a-helical 
subunits (Koronakis et al., 2000). The subunits form a long, wide chan- 
nel that spans the periplasm in their a-helical part and that is prolonged 
through the outer membrane by the £-barrel. The channel is used for 
the export of xenobiotics. 

According to the endosymbiotic theory, the outer membrane of gram- 
negative bacteria corresponds to the outer membranes of mitochon- 
dria and chloroplasts (Reumann et al., 1999). All of them are porous 
and cannot hold an electric potential difference. One long-term can- 
didate for a porin homologue is the voltage-dependent anion channel 
(VDAC) of the outer mitochondrial membrane (Mannella, 1997). A fur- 
ther candidate for a 6-barrel channel in the outer mitochondrial mem- 
brane is Tom40, which contains 6-structure and forms a pore (Hill et al., 
1998). Its molecular mass would suggest a 6-barrel of the size of general 
porins. Unfortunately, none of these proteins has as yet yielded crystals 
suitable for structure analysis. Presumably, they are particularly difficult 
to crystallize because they face the soft cytosol, which does not require 
tough structures, in contrast to their bacterial counterparts that face the 
external medium demanding much higher stability. 


V. FOLDING AND STABILITY 


On first sight the folding process of -proteins appears to be much 
more complex than that of «-proteins. This does not apply, however, for 
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the all-next-neighbor antiparallel transmembrane f-barrels discussed 
here. For the porins it was suggested that the central part of the ho- 
motrimer including all N- and C-termini folds in the periplasm like a 
water-soluble protein so that the membrane-facing parts of the ß-barrels 
dangle as 200-residue loops into the solvent (Schulz, 1992). On mem- 
brane insertion, these loops can then easily meander forming the special 
B-sheet topology. The simplicity of the folding process is corroborated by 
the fact that porins and other transmembrane f-barrels such as OmpA 
(Pautsch and Schulz, 1998) can be (re)natured from inclusion bodies. 
This production method worked even for the large monomeric ß-barrel 
of FepA. 

In vivo, the folding process may be supported by a periplasmic chap- 
erone called Skp. Skp is a 17 kDa protein associated with the plasma 
membrane that, together with peptidyl prolyl isomerases and disulfide- 
exchanging enzymes, helps folding freshly synthesized proteins in the 
periplasm (Schäfer et ol. 1999). Skp binds to partially unfolded polypep- 
tides. Depending on the presence of phospholipids, lipopolysaccharides, 
and bivalent cations, Skp exists in two conformations, one of which is 
protease-sensitive (DeCock et al., 1999). Moreover, it was shown that 
Skp binds to unfolded periplasmic proteins and inserts into phospho- 
lipid monolayers, corroborating its putative role as helper in folding and 
membrane insertion. 

The stability of the 6-barrel itself was demonstrated in engineering 
experiments with OmpA. The four external loops of OmpA were re- 
placed by shortcuts in all possible combinations (Koebnik, 1999). The 
resulting deletion mutants lost their biological functions in bacterial 
F-conjugation and as bacteriophage receptors, but kept the transmem- 
brane ß-barrel as demonstrated by their resistance to proteolysis and 
thermal denaturation. The results confirm the expectation that the large 
external loops do not contribute to £-barrel folding and stability. 

Less dramatic changes were applied to a-hemolysin where the ß-strand 
sequence was altered by reversing the sequence within the £-hairpin 
contributed by each subunit to the -barrel (Cheley et al., 1999). With 
respect to the ß-barrel this changed only the hydrogen bonding pattern, 
but it reversed the sequence in the Gm at the end of the hairpin 
which should, therefore, have local conformational consequences. It 
turned out that the “retro”-barrel formed a channel but failed to invade 
erythrocytes. A high activity could be obtained, however, when the Bum 
was left in its original amino acid sequence, demonstrating that the tight 
P-turns at one end of these barrels are important for the stability of the 
whole barrel. Unfortunately, the detailed structures of the “retro”-barrels 
remained unknown. 
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The interiors of the three small ß-barrels of OmpA, OmpX, and 
OmpLA are filled with polar residues forming a hydrogen bonding 
network. A number of separated cavities mostly filled with water have 
been reported for OmpA and OmpX. Accordingly, these B-proteins are 
rigid inverse micelles and unlikely to form pores. For OmpA there was 
a lengthy discussion on the question of pore formation because small 
pores had actually been detected by ion fluxes (Sugawara and Nikaido, 
1994; Arora et al., 2000). It seems likely that these data were obtained 
with OmpA preparations that had lost the internal barrel structure dur- 
ing the protein purification process permitting channel formation. Such 
conformations should differ appreciably from the crystalline structure. 

Using an atomic force microscope (AFM) operated in the tapping 
mode, the surface structure of OmpF was analyzed under several condi- 
tions. For this purpose, two-dimensional crystals of the porin OmpF from 
E. coli were mounted on graphite or mica and exposed to large voltages 
and low pH as well as high ionic strength environments (Muller and 
Engel, 1999). When operating in the contact mode at minimum force 
the AFM showed a large conformational change in the external loops 
reminiscent of a structural collapse. The authors relate this to voltage 
closure. Unfortunately, the experimental setup is somewhat artifical and 
the actual voltage across the porin cannot be measured. However, the 
effects are obvious and certainly report a structural weakness in the pro- 
trusion formed by the external loops. Since the AFM works with living 
material and is able to detect vertical differences in the range of atomic 
bond lengths, it is likely to yield most interesting results in the future. 

A pore feature attracting continuous interest is voltage gating. This 
effect remains to be explained in detail. It is observed at comparatively 
high voltages across the membrane. Since the corresponding electric 
field strength is so high that it may disrupt hydrogen bonds, it seems 
likely that most of the in vitro voltage gating studies (Liu and Delcour, 
1998; Saxena et al., 1999) report nothing else than a structural break- 
down inside a pore. This does not apply to the voltage-dependent anion 
channel of the outer mitochondrial membrane (VDAC), however, which 
shows channel closure in vivo (Mannella, 1998). Presumably, VDAC con- 
tains a solid but separate mobile and charged domain that can be driven 
by the electric field onto the pore so that it prevents any further ion flow. 


VI. CHANNEL ENGINEERING 


The diffusion of small solutes through porins is passive. The diame- 
ters of the pore eyelets range from 10 A for the general porins to 6 A 
for the highly selective porins. Larger pores are usually decorated with 
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oppositely charged residues at opposite sides that form a local transversal 
electric field at the pore eyelet. This field constitutes an energy barrier 
for low-polarity solutes (Schulz, 1992) so that the bacterium can exclude 
the unwanted nonpolar molecules such as antibiotics despite presenting 
a spacious eyelet for picking up large polar molecules such as sugars. 

The engineering of porins became popular after it had been 
demonstrated that a mass-produced porin (re)natured from inclusion 
bodies had assumed exactly the native conformation (Schmid et al., 
1996). A systematic study changing the pore properties by mutations 
showed a strong correlation between eyelet cross section and diffusion 
rate (Schmid et al., 1998). Furthermore, a series of nine porins with 
mutations at the eyelet was analyzed with respect to ion conductance, 
ion selectivity, and voltage gating (Saxena et al., 1999). It was shown that 
charge reversals affect selectivity and voltage gating. Similar results were 
obtained with mutation at loop L3 of PhoE (VanGelder et al., 1997). In 
contrast to modifications at loop L3 inside the f-barrel, mutations at 
barrel wall residues lining the eyelet had only minor effects on voltage 
gating. This corroborates the suggestion that voltage gating reflects a 
structural breakdown in the pore. 

Sucrose porin has a somewhat larger pore than its homologue mal- 
toporin, the pore eyelet of which is closely adjusted to (1 — 4)-bound 
glucose units (Meyer and Schulz, 1997). With mutations at loop L3 the 
specificity of the sucrose porin was changed toward that of a maltoporin 
(Ulmke et al., 1999). The specificity change was achieved by introduc- 
ing three eyelet-defining residues of the maltoporin and by removing 
the additional N-terminal 70-residue domain of the sucrose porin, the 
structure of which is not yet known. 

The width of a passive channel can be monitored by ionic currents 
through a black-lipid membrane harboring the respective membrane 
protein. The channel may be clogged by organic molecules reducing 
the current over the residence time of such a molecule. This time as 
well as the amount of the current reduction is characteristic for the 
applied compound. Gu et al. (1999) used this principle by placing a cy- 
clodextrin as an adapter into the 14-stranded ß-barrel of an engineered 
a-hemolysin. Using this device they determined the current reductions 
and the residence times caused by a number of modified adamantans, 
demonstrating that they can measure these compounds in concentra- 
tions of around 10 uM and distinguish between various types. In a fur- 
ther study Braha et al. (1997) introduced a zinc ion binding site into the 
B-barrel of a-hemolysin. When occupied, this site changed the channel 
conductivity, so that this barrel could be used as a zinc ion sensor. These 
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are marvelous engineering examples, converting a ß-barrel pore to a 
molecular sensor with an electric output signal. 

In addition to the manifold possibilities of engineering on native 
ß-barrels, these can also be designed from scratch. In an ab initio 
approach Ghadiri et al. (1994) designed cyclic octapeptides and demon- 
strated that these assemble to socalled “nanotubes” forming channels 
through a membrane. The octapeptides consisted of alternating D- 
and L-amino acid residues and thus followed closely the construction 
principle of gramicidin A. In its native conformation gramicidin A forms 
a single strand $-barrel (n = 1) with a shear number S = 6 that may also 
be called a £-helix. The nanotubes are close to the same construction 
with n= 1 and a shear number S= ‘8’, but forming a ring instead of 
a B-helix. Stacking these rings across the membrane in a -barrel with 
an inclination angle o = 90° (Fig. 2A) then produces the channel. 


VII. CONCLUSIONS 


Although the major fraction of transmembrane proteins is «-helical, 
their ß-barrel counterparts have become popular because many of them 
could be analyzed in atomic detail and at high resolution. The transmem- 
brane ß-barrel proteins occur only in the outer membranes of bacteria, 
mitochondria, and chloroplasts. They assume astonishingly regular con- 
formations giving rise to numerous rules for their construction. It has 
been suggested that these regularities are required by the folding pro- 
cess. They are likely to permit the detection of transmembrane f-barrels 
from the sequence at a reasonable confidence level in the future. 

Whereas water-soluble proteins contain regular 6-barrels with up to 
eight strands and nonpolar interiors, transmembrane ß-barrels contain 
eight or more strands and have polar interiors. The smaller transmem- 
brane ß-barrels have solid cores partially filled with water. Accordingly, 
they can be considered inverse micelles. The larger barrels of this type 
have channels along their axis that allow for the permeation of var- 
ious types of solutes. The largest known 22-stranded transmembrane 
f-barrels are used for the active transport of rare commodities through 
the bacterial outer membrane. Their interior contains a globular protein 
domain which functions as a plug that can be removed for transport. 

The success rate for engineering transmembrane f-barrels appears 
to be superior to that for soluble proteins. On one hand, these barrels 
can be mass produced into inclusion bodies and (re)natured therefrom 
in vitro. On the other hand, the interiors of the -barrels housing the 
permeation channels can be mutated with little effect on the barrel 
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construction. The instabilities usually introduced with amino acid 
residue changes can be compensated for to a large degree by the 
surrounding ß-barrel. This allows the application of a rather broad 
spectrum of mutations without destroying the protein construction as 
with water-soluble proteins in which mutations are frequently punished 
by deterioration. Taking advantage of this situation, the 6-barrels have 
already been engineered to sensors for organic molecules and metal 
ions at micromolar concentrations, which, in the future, may lead to 
useful technical devices. 
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I. INTRODUCTION 


Here we reflect on the functional design, engineering, and construc- 
tion of the proteins engaged in the harvesting of light energy and trans- 
forming it into membrane electrochemical gradients and the oxidants 
and reductants that provide energy and material substrates for the cell. 
We focus our attention on light harvesting (LH) proteins and on re- 
action center (RC) proteins of photosynthetic bacteria, cyanobacteria, 
algae, and higher plants. Useful perspective comes from comparing pho- 
tosystem design and engineering with the analogous membrane redox 
proteins of respiration. 
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Figure 1 (see color insert) describes a relay of membrane proteins that 
first transfers light energy captured by the network of the LH proteins 
to an RC protein, which then catalyzes electron tunneling-mediated 
charge separation and transmembrane electric potential generation and 
substrate oxidation and reduction. Redox connection of the RC with 
neighboring intrinsic and peripheral membrane proteins is then medi- 
ated through molecular diffusion of the substrates and products. These 
in turn provide the energy for the oxidoreductases cytochrome bo or 
bef, which, like the RC, promote transmembrane charge separation and 
the buildup of an electrochemical potential gradient of protons. This 
electrochemical potential gradient ultimately provides the driving force 
for the majority of biochemical reactions in the cell, including the dark 
reactions of photosynthesis. 

Although the proteins of photosynthetic machinery are large and com- 
plex, it appears that the engineering that has been favored by blind nat- 
ural selection is comparatively simple and resilient and does not require 
an atom-by-atom examination to appreciate its design. Instead we can 
begin by considering how nature has worked within the time constraints 
imposed by various decay processes by exploiting the distinctive length 
scales that are associated with each stage of the relay (Fig. 2, see color 
insert). We hypothesize that the membrane proteins of the relay evolved 
with a strong selection on the simple distance between the cofactors. 
We contend that distance selection is dominant in providing photosyn- 
thetic and respiratory energy conversion with robust foundations that 
accommodate extensive structural and energetic tolerances. We further 
expect these tolerances within simple, robust frameworks have fostered 
the broad palette of evolutionary variance evident in the proteins of mi- 
croorganisms, animals, and plants and have enabled them to generally 
operate well away from the boundaries of failure and pathogenesis. This 
relative simplicity and robustness also promises to be translatable to, and 
exploitable in, synthetic systems of energy conversion. 


II. OVERVIEW OF LENGTH SCALES IN BIOENERGETIC MEMBRANES 


The physical process of energy conversion describes a few basic length 
scales that determine the general design of photosynthetic proteins. 
Electron tunneling defines the first and most conspicuous length scale in 
redox proteins that couple the early photosynthetic events of light energy 
transfer to the later proton transfer, redox catalysis, and diffusive motion 
reactions (Fig. 2). Indeed, RCs provided the first demonstration of elec- 
tron tunneling in biology in the early 1960s (Devault and Chance, 1966) 
and have continued to provide much of the now extensive biological 
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electron tunneling information in the years since. There isa tremendous 
experimental advantage in the natural ability to activate LH and RC pho- 
tosystems with single-turnover flashes and time-resolve both donor oxida- 
tion and acceptor reduction from tens ofseconds to tens offemtoseconds 
at temperatures from 300 Kto 1 K. Furthermore, RCs have demonstrated 
the stability and the molecular biological and biochemical flexibility 
needed to alter structure and free energies of individual tunneling re- 
actions for systematic studies, backed by the vital high-resolution struc- 
tures. As a result, simple guidelines have emerged from the work on 
RCs that are now proving useful in the understanding of tunneling pro- 
cesses in other redox proteins, extending our insights to the tunneling 
engineering guidelines of electron transfer proteins in general. 

The free energy (AG) dependence of the electron-tunneling rate 
gives rise to a second length scale, this time an energy/length scale. 
As shown in Fig. 2B, for any characteristic time and free energy, there is 
a maximum distance over which tunneling can extend. This is true for 
both the traditional exergonic electron transfers and the endergonic 
or uphill electron transfers (Page et al., 1999). Electron transfer to an 
energetically unfavorable redox intermediate can take place within a 
given time as long the uphill electron transfer is followed eventually 
by a favorable downhill electron transfer. Indeed, endergonic electron 
transfers through intermediates hundreds of meV uphill can take place 
faster than typical enzymatic turnover times provided the redox centers 
are closer than 12 A. The smaller the distance, the more energetically 
unfavorable the electron transfer intermediate can be. 

Transmembrane electric field strengths provide a related length scale 
in these membrane-bound proteins, since the energetics of electron 
transfer are dependent on the length of the electron transfer along the 
membrane normal. Light-driven electron tunneling in RCs is coupled to 
electronic charge separation across the ~35 A of the membrane low di- 
electric profile. The initial picosecond charge separation in RCs occurs 
within the several chlorophylls (Chl) or bacteriochlorophylls (BChl) of 
the RCs and is mostly parallel to the membrane. In the subsequent elec- 
tron transfers, the electron and hole move in opposite directions to 
span the supporting membranes and generate a transmembrane elec- 
tric potential (AW) that together with the transmembrane pH gradient 
(ApH) is an essential component of the electrochemical proton gradi- 
ent (AuH*). A similar pattern is followed in many of the companion 
oxidoreductases in which the principal catalytic event involves mostly 
lateral electron transfer, succeeded by transmembrane electric field gen- 
erating and proton moving steps. In the photosynthetic and respiratory 
machinery the longest transmembrane transfers contribute the most to 
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AuH* and are also the most susceptible to inhibition by preexisting 
electric fields. In contrast, the lateral catalytic reactions are relatively 
unaffected by transmembrane fields. 

The magnitude of the prevailing AuH*, as well as the proportions of 
the ApH and AW to AuHt, varies from species to species (Dutton et al., 
2000). Across the 35 A low dielectric region, bacteria develop more than 
250 mV of electric field and 0.5 units of ApH. Mitochondrial membrane 
fields are a bit less at ~170 mV, with a similar pH drop, while chloroplasts 
may have little transmembrane field, and a much larger proton concen- 
tration gradient, up to 4 pH units. Thus bacterial reaction centers and 
oxidoreductases have to face an electric field of nearly 10° V/m when 
moving charges across the membrane. This can be enough to partially in- 
hibit the bq complex (Shinkarev et al., 2001). On the other hand, the mi- 
nor electric fields sensed by plant PSI, PSII, and be f means that the trans- 
membrane electron transfers themselves are not inhibited. However, the 
large ApH will eventually inhibit the proton-exchanging quinone and 
water/oxygen redox reactions and slow the turnover of the system. 

A third length scale is defined by the dependence of energy trans- 
fer upon the distance between chromophores. As shown in Fig. 1, light 
capture occurs predominantly by chlorophylls, carotenoids, and other 
pigments associated with LH proteins that transfer their energy to a re- 
action center (Cogdell et al., 1999). Light is also captured directly by the 
pigments of the RC protein itself. However, RCs differ sharply from LHs 
by being equipped and engineered to couple absorbed light energy to 
electron tunneling and for the production of strong but stable oxidants 
and reductants. The length scales for energy transfer efficiency and elec- 
tron tunneling in Fig. 2 are sharply different. The 1/r? or 1/r® decay 
of energy transfer efficiency, where r is the distance between cofactors, 
is less severe than the exponential decay of the tunneling rate with dis- 
tance (Sundstrom et al., 1999). However, the ultimate engineering limit 
of light energy transfer is the decay process of fluorescence, shown as 
the purple band in Fig. 2A. The need to complete energy transfer before 
fluorescence decay is one of two principal time constraints on the range 
of permissible intercofactor distances in these bioenergetic proteins. 

Diffusion provides a fourth distance scale. Each of the protein 
complexes generates diffusible oxidants and reductants in the differ- 
ent compartments defined by the bioenergetic membranes. Thus, the 
PSI RC of cyanobacteria, algae, and plants deliver the strong stable re- 
ductant NADPH to the cytoplasm or chloroplast stroma, where it is used 
for biosynthesis. The corresponding strong oxidant dioxygen is released 
from the luminal side of the PSII RC and diffuses away to be consumed 
in respiratory processes within the cell or expelled from the organism as 
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a by-product. Other oxidants and reductants produced by the different 
RCs are used locally as substrates with a redox potential difference (A Eh) 
that drives further, respiratory-type electron transfer in cytochromes Ae 
and bsf. Thus, RCs from purple bacteria produce hydroquinones in the 
membrane and ferricytochrome c in the periplasm that diffuse away 
to become substrates to the cytochrome bc; which regenerates ferrocy- 
tochrome c and quinone to complete a cycle. In plants, PSII and PSI 
produce the analogous hydroquinone and oxidized plastocyanin or cy- 
tochrome ce as substrates for the cytochromes bęf in an overall linear 
electron transfer chain. 

The distances that are covered by substrates and products to reach 
the next membrane protein are considerable. Ubiquinone diffuses tens 
to hundreds of angstroms to connect RC and bo in bacteria, while plas- 
toquinone diffuses even further between PSII and b¢f in chloroplasts. 
Figure 1 shows some approximations of diffusion of small organic ox- 
idants and reductants such as NAD*+/NADH and redox proteins such 
as cytochrome c, plastocyanin, or ferredoxin. The diffusion of small 
molecules is sufficiently fast that distances greater than 5 A can be cov- 
ered in less time than a typical electron tunneling reaction with relatively 
small driving force. For larger molecules, such as cytochrome c and 
ubiquinone, diffusion is faster than typical electron tunneling reactions 
over distances greater than about 14 A. Although there is little control 
over which direction the molecules diffuse in order to encounter their re- 
action partner, this difficulty is largely overcome by using pools of diffus- 
ing agents, such as the ubiquinone and plastoquinones in the membrane 
quinone pools, and the smaller pools of water-soluble cytochrome c or 
plastocyanin attracted to the membrane surface. At any rate, diffusion 
is fast enough to keep up with the rates of catalytic turnover (kcat) of 
oxidoreductases of roughly 10° s~! (Gupte et al., 1984). 


III. MANAGING LENGTHS IN NATURAL REDOX PROTEIN DESIGN 


The time constraints required for competent bioenergetic function 
have influenced the geometry of cofactor assembly in photosynthetic 
proteins. Where time constraints are severe, distances will be confined. 
On the other hand, where time constraints are relaxed and there are 
conspicuous gaps between the characteristic times of different physical 
processes (Fig. 2), distances and other parameters such as free energy 
can be more relaxed. This relaxed situation allows robust operation of 
the redox protein, one well away from the limits of failure rather than 
a highly optimized but fragile design. This condition has apparently 
been favored repeatedly by natural selection. Thus light energy transfer 
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is generally faster than fluorescence decay times for distances smaller 
than about 40 A, and is also comfortably faster than electron tunneling 
at any distance. Electron tunneling in turn outstrips diffusion of modest- 
sized redox agents for distances smaller than about 12 Ä. However, be- 
yond these limits, diffusional considerations dominate the movement of 
oxidizing and reducing equivalents. Even so all of these processes are 
generally much faster than the natural rate-limiting steps of the enzy- 
matic activities that terminate all photosynthetic and respiratory electron 
transfer chains (Suarez et al., 2000). This fundamental time constraint 
apparently reflects the typical time and energy required to arrange the 
catalytic site into a catalytic intermediate state, as described by Michaelis 
(1951). By examining the structures of many natural electron-transfer 
proteins, it seems clear that nature has ensured that electron tunneling 
is faster than keat by keeping the tunneling distances between cofactors 
shorter than about 14-15 A (Page et al., 1999). 

Electron tunneling over transmembrane dimensions is prohibitively 
slow relative to typical kcat times. Instead, membrane proteins use the 
simple device of chains of redox cofactors. Figure 1 shows that each 
of the RC and bc or bgf membrane proteins accommodates a large 
number of redox cofactors obviously lined up in chains. This is true even 
for other membrane oxidoreductases with “surplus” cofactors that for 
many years were considered supernumerary. Consecutive tunneling over 
several short steps replaces the exponentially slower tunneling rates at 
long distances with an approximate linear dependence of rate decrease 
on distance tunneled. We note that the chains are largely arranged to 
promote tunneling across the membrane and beyond into the aqueous 
phase for distances up to 100 A, while diffusive connections are largely 
lateral to the membrane. 

Figure 1 illustrates that the overall design of the different transmem- 
brane RCs and cytochrome bc, structures are remarkably similar. In 
each case the site of primary energy conversion, be it the array of re- 
dox (B)Chls in the RCs or the hydroquinone oxidizing Ou site of the 
cytochromes bc), comprises a cluster of redox centers separated by rela- 
tively short 4.5 + 1 A gaps and is located close to one membrane aqueous 
interface. In each case the electron transfer integral to the primary en- 
ergy conversion yields a charge-separated state oriented almost parallel 
with the membrane plane and is itself not significantly electrogenic. In 
each case the initial charge separation is then propagated across the 
membrane profile through a series of electron transfers over somewhat 
larger distances of 8 + 4 A. The job is distributed between four or five 
redox cofactors that extend from one side of the membrane to the other. 
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Because nuclei are so much more massive than electrons, nuclear 
tunneling is much slower than electron tunneling at any distance. Co- 
factors must be in near contact for direct hydrogen or hydride tunnel- 
ing over the few tenths of an angstrom that has been invoked in two and 
multiple-electron transfer in the catalytic mechanisms of flavin, quinone, 
and NAD (P), and in water oxidation and oxygen reduction. On the other 
hand, at the shortest distances even highly endergonic electron tunnel- 
ing might be a faster alternative than direct hydride tunneling provided 
there are other proton donors and acceptors in the vicinity. For example, 
two one-electron transfers through a 0.6 eV (~14 kcal/mol) endergonic 
radical intermediate state can be as fast as a millisecond reaction even 
when the overall driving force for the two-electron reaction is zero (Page 
et al., 1999). 

Advances in structural work now permit us to consider the natural 
distribution of distances between cofactors in photosynthetic proteins. 
The X-ray crystal structures of RCs from Rhodopseudomonas viridis (Michel 
et al., 1986) and Rhodobacter sphaeroides (Ermler et al., 1994), species that 
had already been the subjects of 20 years of intensive investigation, pro- 
vided the first precise tunneling distances between the variety of photo- 
synthetic redox cofactors and details of the surrounding polypeptides. 
Thermochromatium tepidum added a third species to the bacterial reaction 
center structures determined to high resolution (Nogi et al., 2000). More 
recently crystals of the RCs of PSI from cyanobacter Synechococcus elon- 
gatus yielded high-resolution structures (2.5 A) of the polypeptide and 
the chlorophyll, pheophytin, phylloquinone, and iron sulfur cofactors 
(Jordan et al., 2001). This was followed by 3.8 A structures of the RC of 
PSII of Syn. elongatus (Zouni et al., 2001) with enough secondary struc- 
tural features to allow spectroscopy and biochemistry to help resolve the 
individual chlorophylls, pheophytins, quinone, tyrosines, and Mn. Struc- 
tures of the intrinsic membrane LH protein partners to the RCs have 
appeared in the past few years: bacterial proteins LH] and LH2 from 
certain species have been modeled to the RCs from Rps. viridis and Rba. 
sphaeroides (Cogdell et al., 1999 and references therein). In the case of 
the PSI and PSII structures, extensive and quite different LH equipment 
is resolved in the many subunits surrounding the RCs. The structure of 
cytochrome bq, a respiratory companion to these photosynthetic pro- 
teins, has also provided us with the tunneling distances of the heme and 
iron sulfur cofactor constituents and one clearly located ubiquinone, 
probably a bound form of the substrate. 

The distances for the physiologically productive electron transfers of 
reaction centers with crystal structures (some illustrated in Fig. 3; see 
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color insert) can be divided into basic functional groups, as indicated by 
the different colors in the histogram of Fig. 4 (see color insert). There 
are obvious similarities and differences between the various reaction 
center structures that reflect the different physiologies of the reaction 
centers. The initial charge separation between chlorins takes place atthe 
shortest distances. In many cases the central pair of these chlorophylls 
is noticeably closer together than the other chlorins, between 3 and 4 A; 
however, the central pair in PSII has a spacing that falls in the average 
chlorin spacing of 4 to 6 A. Even at these slightly longer spacings, electron 
tunneling will still be more than rapid enough to ensure efficient charge 
separation. Instead, the wider central pair in PSII may help to assure the 
relatively high oxidation potential of the ground state, reflecting its role 
in water oxidation (Barber and Archer, 2001). 

The coupling into and out of the chlorin charge separation center 
takes place over longer distances of 6 to 11 A. The various redox chain 
separations that move charges out to the extremities of the protein for 
delivery to other redox protein partners in diffusive reactions occur over 
still longer distances of 8 to 13 A. Distances longer than 14 to 15 A are 
generally not found for physiologically productive reactions, because 
such reactions would not be significantly faster than the typical times for 
catalytic turnover of the enzyme. Longer electron-transfer distances are 
covered by cofactor chains, which so obviously extend above or below 
the chlorin centers of the photosystems. An interesting variation on the 
redox chain theme is provided by the FeS center of the cytochrome bq 
complex, which appears poised in evolutionary terms between a freely 
diffusive redox component and a fixed redox chain component. The 
double-headed arrow of Fig. 3 shows the FeS movement between Qo 
and heme cı implied by different crystal structures. There may be some 
advantages in robustness for maintaining a limited FeS diffusibility that 
leads to an escapement device that helps to ensure that the two-electron 
oxidation of Oo proceeds by two one-electron transfers in opposite direc- 
tions. These distances in this histogram have a comparable distribution 
to the set of productive electron transfer reactions between all the redox 
centers in multiredox-center electron transfer proteins deposited in the 
PDB database. 


IV. MANAGING LENGTH AND SIZE IN NATURAL LIGHT-HARVESTING DESIGN 


There are now sufficient high-resolution structures of antenna com- 
plexes available to make the detailed comparison of their functional 
design worthwhile (Cogdell et al., 1998). All light-harvesting systems by 
design must absorb light in the region of the spectrum that is actually 


LENGTH, TIME, AND ENERGY SCALES OF PHOTOSYSTEMS 79 


available in the ecological niche in which that organism lives, and they 
must deliver the excitation energy efficiently to its reaction center. The 
first of these imperatives principally directs the choice of pigment, while 
the second directs the overall architectural design of the antenna com- 
plex. Can we see any common features that would allow us to think of 
a basic prototypical antenna complex on which all the present-day vari- 
eties have been based? The only feature that easily stands out when these 
antenna structures are compared is that there are no common structural 
features. There are clearly many ways to design an antenna complex that 
is sufficiently energy efficient to allow that organism to be successful on 
an evolutionary time scale. 

Given that the structure of antenna complexes seems to be so variable, 
does this mean that the physics of the energy transfer process is rather 
nonstringent so that it will tolerate many design solutions to the problem 
of constructing an efficient light-harvesting complex? Ifa donor pigment 
is going to efficiently transfer excitation energy to an acceptor molecule 
then it must do it much faster then the excited state of the donor is 
lost by other competing processes. So what factors affect the rate of 
this energy transfer process? In general terms the rate of excitation en- 
ergy transfer depends on energy terms (spectral overlaps in the Förster 
theory), an orientation factor (the relative disposition of the transition 
dipole movements of the donor and acceptor in the Förster theory) and 
distance (the precise distance dependence varies with the type of en- 
ergy transfer) (Sundstrom et al., 1999). Of these terms distance is the 
strongest determinate of the overall rate of energy transfer. How far 
apart the donor and acceptor molecules in a light-harvesting complex 
can be, if the energy term and the orientation factor are held constant, 
will depend on the fluorescence lifetime of the donor’s excited singlet 
state. If, as is the case of carotenoids, this is very short, e.g., a few hun- 
dred femtoseconds, then the donor and acceptor molecules must be 
very close together. In all antenna complexes that contain chlorophylls 
and carotenoids, the carotenoids are always in van der Waals contact 
with at least one chlorophyll molecule. If the excited singlet state life- 
time of the donor molecule is rather long, e.g., a few nanoseconds, then 
the distance between donor and acceptor can be much longer. In the 
LH1/RC “core” complex from purple bacteria the distance between an 
antenna BChla molecule and the RC primary donor BChld has been 
estimated to be 483 A, but the rate of energy transfer of 35-50 ps is still 
fast enough to maintain very high quantum efficiency. 

In the classical Förster treatment of energy transfer between two well- 
separated donor/acceptor molecules the rate of energy transfer de- 
pends on 1/r°. Though there is a strong distance (r) dependence it still 
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allows a relatively large leeway in the special organization of chlorophyll 
molecules in an antenna complex, while still maintaining high overall 
efficiency of energy transfer. In the LH] complex from purple bacte- 
ria the distance between a B800 BChl molecule (the donor) and the 
nearest B850 BChla molecule (the acceptor) is ~18 A. The rate of this 
energy transfer renders as ~1 ps. Since the excited state lifetime of the 
donor is ~1 ns the overall efficiency of this energy transfer is very nearly 
100%. If the distance between the donor and acceptor was increased to 
30 A then the rate would decrease to ~23 ps, but the overall efficiency 
would still be ~99%. Similar arguments pertain to Chl-containing LH 
complexes of eukaryotes such as LHCII (Kuehlbrandt et al., 1994) or the 
inner antenna systems of PSI (Jordan et al., 2001; Zouni et al., 2001). 

It is clear that the basic physics of the energy transfer process allows 
for a large degree of tolerance in the design of a light-harvesting system 
before overall efficiency is compromised. The only cases where this is 
not true are where the singlet excited-state lifetime is so short, i.e., in 
the case of carotenoids, that donor and acceptor molecules must be in 
van der Waals contact to retain any efficiency at all. Interestingly, the es- 
sential function of the carotenoids is one of photoprotection (Frank and 
Cogdell, 1996). They rapidly quench the excited triplet states of chloro- 
phylls before they can react with molecular oxygen and produce the 
lethal singlet oxygen. The carotenoids prevent this by a triplet-triplet 
exchange reaction, and the resultant carotenoid triplet decays harm- 
lessly giving off the excess energy as heat. This triplet-triplet transfer 
reaction occurs by electron exchange, which requires van der Waals 
contact. This distance constraint, then also allows the carotenoids to act 
as accessory light-harvesting molecules (singlet-singlet energy transfer) 
because they have perforce been placed so close to the chlorophylls 
(Frank and Cogdell, 1996). 

It appears that all “core” antenna structures have a significant sep- 
aration between the closest antenna chlorophylls and the redox active 
chlorophylls. There is almost a “cordon sanitaire” around the heart of the 
reaction center. Why should this be? Oxidation of an antenna chloro- 
phyll produces a very effective quenching center that will “trap” and 
dissipate the energy long before it can reach the RC, where it is used 
productively (Law and Cogdell, 1998). It is therefore vital that the an- 
tenna chlorophylls closest to the reaction center chlorophylls be placed 
at a distance where electron transfer between the RC and the antenna 
system never occurs. Excitation energy transfer is still efficient over this 
distance and so the light-harvesting function is not compromised. Inter- 
estingly, though, this often means that paradoxically the slowest step is 
the energy transfer chains in the final “hop” into the reaction center. 
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This potential danger of electron transfer out into the antenna is there 
because the midpoint potential of an antenna chlorophyll molecule is 
not that much higher than that of the previous donor in the RC, and in 
PSII it is clearly well below that of P680. 

In every basic biochemical textbook the photosynthesis unit is de- 
scribed as a “funnel.” The energy transfer reactions are depicted as go- 
ing “downhill” from the periphery to the reaction center. However, the 
real picture is not always like this. As Fig. 5 (see color insert) shows, in 
most cases the final energy transfer step to the RC is “uphill,” and in the 
“core” complexes of PSI and PSII the antenna chlorophylls are largely 
isoenergetic. This then prompts the questions of how deep the funnel 
must be or how big the LH system can get in the absence of a funnel. An 
ever increasing body of work on artificial antenna mimics has shown that 
in the absence of a tightly coupled antenna system, as seen in the chloro- 
somes of green sulfur bacteria, above a certain critical antenna size the 
efficiency of energy transfer to the “trap” drops off dramatically. In effect 
in the absence of an energy gradient to “guide” the excitation energy 
to the trap it gets “lost” and never makes it to the trap. In most purple 
bacteria the size of the photosynthetic unit depends on the incident light 
intensity: the lower the light intensity the larger the size (Cogdell et al., 
1999). This is accomplished by making more LH2 relative to LH]. The 
energy of the BChl in LH2 is higher than that of those in LH] and so 
the funnel is established. Interestingly, some purple bacteria can control 
the steps of the funnel, too. Ifsome strains of Rps. acidophila are grown at 
very low light intensity this induces a new type of LH2 (called B800-820) 
where the large ring of closely interacting BChl molecules now absorb 
at 820 nm rather than 850 nm. This increases the slope of the “funnel” 
and prevents back transfer which allows these bacteria to grow at even 
lower light intensity than species which cannot accomplish this. Similar 
adjustments in the size of the LH system while maintaining the funnel 
in response to low light intensities are observed in other organisms, 
ranging from cyanobacteria (increasing phycobilin content) to higher 
plants (increased LHCII content). PSI, on the other hand, has about 
100 Chls per P700 and there appears to be no discernible energy funnel 
present. That being so, is there a maximum size for a “core” antenna 
in the absence of a “funnel”? It is tempting to propose that PSI is oper- 
ating at or near this limit. Indeed, this idea is reinforced in the case of 
iron-limited growth of cyanobacteria or in deep-living strains of the ma- 
rine oxyphotobacteria, Prochlorococcus, where any extra ring of antenna 
complexes supplement the antenna size of their PSI complexes. In the 
case of Prochlorococcus, these extra complexes have Chls a and b which ab- 
sorb to the blue of the “core” PSI Chla’ molecules, thereby establishing 
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the funnel and allowing a doubling of the size of the light-harvesting 
components while maintaining high efficiency. 

As mentioned above, the chlorosome antenna system of green sulfur 
bacteria breaches this size limit for an isoenergetic antenna. However, 
this is possibly due to its unique structure (Blankenship et al., 1995). 
The BChlc molecules are packaged into the chlorosome forming tightly 
coupled, strongly interacting helical arrays. Within these arrays the ex- 
cited state is extensively delocalized and, in effect, the array acts a super 
molecule. This enhances the rate of energy migration to such an ex- 
tent that a single chlorosome can operate as an efficient antenna system 
even when it contains up to 20,000 BChlc molecules. The green sulfur 
bacteria are strict anaerobes and therefore live in rather deep anaero- 
bic layers. The chlorosomes have such a high cross-structural area for 
photon capture that the cells appear black! In contrast, cyanobacteria 
and red algae use a range of phycobilin pigments in their LH systems. 
They are spectrally distinct and arranged within the phycobilisome so 
that light energy is absorbed at the peripheral ends of the rods by short 
wavelength pigments and transferred by the “funnel effect” to the longer 
wavelength absorbing allophycobilin adjacent to the membrane surface 
and close to the chlorophylls of the RC. 

The take-home message for understanding the management of inter- 
cofactor distances in natural antenna system design is that although the 
rate of excitation energy transfer is extremely sensitive to distance, the 
overall efficiency of energy transfer is, in general, not (see Fig. 2). This 
explains how it is possible to have such a variety of structures of antenna 
complexes and yet always retain efficient functioning. This freedom, 
then, has allowed Nature to have a very wide design brief in the con- 
struction of antenna complexes. 


V. MANAGING DISTANCE IN ELECTRON TRANSFER 


A. Predicting Electron Transfer Rates from Structure and Distance 


Like plant RC proteins, bacterial RC proteins each support 10 or more 
individual electron transfer reactions, and many have long been experi- 
mentally accessible for direct measurement of tunneling rate constants. 
The distances between the cofactors range from near van der Waals con- 
tact to as far away as 24 A and include tunneling steps both physiologically 
productive and counterproductive. The early application of molecular 
biological methods, as well as the natural stability and facile biochemi- 
cal cofactor replacements of the bacterial RC, made the purple bacterial 
reaction centers a test bed of models of biological electron tunneling, 
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opening the door to the examination ofthe role played by the electronic 
structure of the polypeptide tunneling medium between the cofactors. 
On the nuclear side of the equation, a variety of methods to alter and 
measure AG values in situ combined with flash-activated kinetic analy- 
sis has verified the Marcus-like (Marcus, 1956) parabolic log rate versus 
free energy relationship at ambient temperatures, as shown in Fig. 2B. 
In some key cases, the free energy versus rate relationships have been 
examined down to cryogenic temperatures (Gunner and Dutton, 1989; 
Gunner et al., 1986), verifying the original suggestion of Devault and 
Chance (1966) that at low temperature nuclear motion obeys a quantum 
tunneling rather than classical behavior originally described by Marcus. 
The parabolas shown in Fig. 2B are indeed quantized which has the ef- 
fect of flattening the curve and changing the prefactor on the nuclear 
term from 4.2 to 3.1 in the equation in the figure legend. 

For many of the bacterial RC reactions, the rate constant was measured 
or reasonably extrapolated to — A G? = X (Dutton and Moser, 1994). This 
provided critical kmax Or kopt values isolating the electronic contribution 
to the tunneling rate of each reaction (unmodulated and uncompli- 
cated by the natural variances of the Franck—Condon factor) and allows 
comparison of the tunneling medium between the cofactors of differ- 
ent electron transfers throughout the RCs. Figure 6 (see color insert) 
shows the now 10-year-old, remarkable length scale for tunneling in two 
bacterial RCs already presented in the general Fig. 2A focusing on the 
contribution of the protein structure to the electronic term. The ap- 
proximate exponential change of kopt over 12 orders of magnitude with 
distance from near van der Waals contact to 23 A reveals a straight- 
forward engineering blueprint for electron tunneling in the bacterial 
RC that is dominated by distance. This work provided the perspective 
needed to verify that the rate at van der Waals contact is near 10! el, 
about the rate-limiting time of one molecular vibration, and to suggest 
that tunneling reactions within the RC protein adhere reasonably well 
to this fundamental constant. 

The work also provided evidence that the initial charge separation 
from the light-activated BChls to BPh was two successive generic tunnel- 
ing reactions using the intervening BChl monomer as a real intermedi- 
ate, as opposed to a special single-step coherent electron transfer with 
the BChl monomer playing a special role as superexchanging virtual 
intermediate. In the end a two-hop mechanism has prevailed, and the 
single-step rate now appears to be far too slow (see van Brederode and 
van Grondelle, 1999). 

Figure 6 offers a seductive menu of motifs that in principle could have 
been selected to raise or lower the tunneling barrier and hence slow 
or speed the electron transfer rate at any one distance. However, the 
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structurally disparate reactions in the RCs have no distinction between 
physiologically productive and counterproductive electron transfers and 
closely adhere to a single exponential pointing to 10!° s71. This suggests 
that motifs that can be constructed in synthetic systems are not a con- 
spicuous feature in the natural proteins (Moser et al., 1992). The line 
fitted to the points in Fig. 6 provides an apparent “$ value” of 1.4 Aq}, 
where £ is the coefficient of exponential decay of the tunneling rate with 
distance. This £ can be interpreted simply as the weighted average of £ 
values of two compartments in the protein structural volume between 
the cofactors. Protein volume contained within the van der Waals radii of 
atoms and bonds is assigned a £ of 0.9 A~!, a value slightly less than that 
obtained for the aliphatic bridge, and the remaining interstitial volume 
is assigned a H of 2.8 AT. the lesser value for vacuum. The electronic 
term for optimal tunneling rate constant can be modified to introduce 
a packing density term, p, the fraction of the protein volume between 
the cofactors that is inside the van der Waals radii. This offers a simple 
way to calculate optimal rates directly from structures. 

The simple packing insensitive equation of Fig. 2 has a standard devia- 
tion with measured rate of about 10-fold (an order of magnitude). When 
the packing density is included in the equation of Fig. 6, the variance 
falls within the level of the collected experimental uncertainties in the 
structural resolution, and the kinetic, AG”, and A measurements. The 
packing rate expression lends itself to application to all electron transfer 
proteins. Calculated optimal tunneling rate constants from the p values 
and distances between the cofactors in other nonphotosynthetic proteins 
are also shown in Fig. 6 (pale points). This broader sample of proteins 
that perform a wide variety of oxidoreductase functions emphasizes the 
natural heterogeneity of the internal protein structure between reacting 
cofactors; again, there appears no statistical difference in p for physio- 
logically productive and unproductive reactions. Thus, it is clear that 
generally nature has not evolved specific structural motifs in the protein 
to enhance productive and suppress unproductive electron tunneling. 

The recurrent expectation over the past decade that aromatic amino 
acids commonly positioned in the protein between the donor and ac- 
ceptor will enhance electronic coupling and speed up tunneling was put 
to test in an unmatched series of measurements done with Rps. viridis 
(Dohse et al., 1995). This careful study on the structure shown in Fig. 7 
(see color insert) and results in Table I included direct measurements of 
AG? values, time-resolved kinetic analysis of the electron transfer rate, 
and crystal structure determination of the native Y162 and two mutants 
Y162F and Y162L. The findings support the conclusions that neither 
H-bonding nor aromatic z-orbitals of tyrosine contribute significantly to 
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TABLE I 
Rps. viridis Tyr Mutations” for Which Crystal Structures and Free Energies are Available’ 





Calculated Experimental 
Mutation Distance (A) AG (meV) Packing log rate log rate 
WT (Tyr) 12.29 —140 0.76 6.50 6.73 
Phe 12.28 —190 0.72 6.49 6.76 


Thr 11.88 —198 0.62 6.08 6.00 


“ From Dohse et al. (1995). 
b This allows a comparison of packing density sensitive rate with experiment, and the 
possible special effect of changing an aromatic for nonaromatic residue. 


the tunneling reaction beyond that provided by aliphatic replacements. 
Although crystal coordinates are not available for four other mutants 
that included methionine and glycine, they confirm this conclusion; in- 
deed methionine proves to be faster than the wild-type protein. These 
exercises present us with a rudimentary but clear view of protein as a 
highly robust electronic coupling medium for tunneling between redox 
cofactor donors and acceptors. 

Table I shows that two other parameters change significantly upon mu- 
tation of the tyrosine, the AG and the packing density, o These changes 
appear important in accounting for the ~6-fold rate variations that are 
evident in the family of Y162 mutants. By including these parameters in 
the four-parameter rate expression of Fig. 6, and assuming A value of 
around 0.7 eV, the calculated rate constants match the measured ones 
rather well (within a factor of 2). In separate studies the same authors 
specifically altered the AG by changing the reduction potential of the 
heme, first by mutating a charged residue near the heme but outside 
the tunneling volume, and second by introducing coulombic effects by 
adjusting the redox states of nearby hemes (Chen et al., 2000). Again, 
the effect on the rates followed that expected for 0.7 eV reorganization 
energy. 


B. Robust Natural Design of Charge Separation through Redox Chains 


1. Heme Chain Archetype in Rps. viridis 


Chains of redox cofactors for long range electron transfer are clearly 
the way electrons are transferred over the tens of angstroms dimensions 
of membranes and their proteins. Once again, purple photosynthetic 
bacterial reaction centers provide an archetype for understanding elec- 
tron transfer chain design and behavior. The heme chain in Rps. viridis 
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(Fig. 8; see color insert) extends from the BChls out into the periplasm 
some 60-70 A to a docking site for soluble cytochrome cg at the ter- 
minus. The chain is characterized by close positioning of the hemes, 
typically 5 A, and intrinsically poised to promote very rapid tunneling. 
The reduction potentials describe a dramatic sequential combination 
of endergonic and exergonic tunneling steps that is overall from the 
cytochrome ca to the BChly modestly exergonic (Alegria and Dutton, 
1991; Shopes et al., 1987). 

The first measurement of electron transfer in the chain, from heme 3 
to heme 1 in 1.8 us (Shopes et al., 1987), suggested that tunneling could 
proceed via thermally assisted uphill tunneling to the intervening heme 
2 and then exergonically to heme 1, if the intrinsic electron transfer 
rates between the hemes were in the picosecond time scale, as suggested 
by our calculations. Using a A of 0.7 eV, the equilibrium reduction po- 
tentials adjusted for charge interactions, and edge-to-edge distances, R, 
the calculated transit time is ~2 us. The figure shows several other mea- 
surements of the remarkably rapid transit through the five hemes to 
the BChls, which again track the calculated rates well. Thus it appears 
that it is the close proximity of the hemes in the structure that sets the 
stage for simple sequential electron hopping from heme to heme in- 
cluding substantially unfavorable thermally accessed states. It appears 
that equilibrium reduction potentials of the hemes (only adjusted for 
heme-heme coulombic interactions) are operative on the time scale of 
electron transfer through the heme chain. Additional support for this 
simple model comes from the activation energy obtained from the tem- 
perature dependency of tunneling from heme 3 via heme 2 to heme 1 
which matches the free energy obtained from the reduction potentials. 
This also argues against a model in which the intervening heme 2 acts 
not as a real redox intermediate in a two-step electron transfer, but as a 
quantum mechanical virtual superexchange intermediate in a single co- 
herent step tunneling from heme 3 to heme 1. Like the tyrosine between 
the heme and BChls, the heme between heme 3 and heme 1 occupies a 
major part of the intervening space but does not act a dramatic tunneling 
barrier lowering element. 


2. Primary Chlorin Chains 


The redox chlorins at the core of the various reaction centers form an 
obvious chain with separations of less than 6 A that ensure rates of 10 ps 
or less. The effect of this chain is to apparently make the photoinduced 
oxidant and reductant capable of residing briefly on any of the chlorins, 
subject principally to the free energy of that state and the energetic 
penalty of any uphill reverse electron transfer. When the free energy 
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differences between the various chlorins are small and comparable to 
the thermal energy, as appears to be the case in the RC of PSI, the light- 
induced reductant and oxidant may be appropriately described as being 
shared by all the chlorins. What guides the eventual fate of the oxidant 
and reductant is simply the availability of the closest nonchlorin centers, 
which continue charge separation in relatively longer, slower, and larger 
driving force electron transfers. 

The symmetry of the chlorins in the RC means that there are two 
branches of the chlorin chain extending across the membrane. In 
the case of PSI and even more so in the RC of green sulfur bacteria 
(Golbeck, 1994), there may also be a near functional symmetry of elec- 
tron transfer across these chains to the symmetrically placed quinone ac- 
ceptors and then to the centrally located iron-sulfur center, F, (Barber 
and Archer, 2001; Guergova-Kuras et al., 2001; Jordan et al., 2001). How- 
ever, in the case of RCs of purple bacteria and PSII the situation is quite 
different. Only one of the quinones (Qa) appears to be designed as the 
functional acceptor. How is the high quantum efficiency of charge sep- 
aration maintained by guiding electron transfer down the appropriate 
branch of these RCs? In the case of the purple bacterial RC, if the en- 
ergy gap between the bacteriopheophytins and the bacteriochlorophylls 
on both branches were about 0.16 eV, then some electrons could be 
trapped on the inactive bacteriopheophytin leading to energy-wasting 
charge recombination to the bacteriochlorophyll dimer ground state. 
Indeed, under certain experimental conditions (e.g., excess light) the 
wrong side BPh can be driven reduced (Florin and Tiede, 1987). The 
active side BPh may be favored kinetically, rather than thermodynami- 
cally, by a lower wrong side BChl monomer redox potential that makes 
electron transfer from the dimer to this side sufficiently uphill; a similar 
effect can be accomplished by a larger reorganization energy for this 
electron transfer (Moser et al., 2001; Parson et al., 1990). 

Although at first sight the engineering of PSII and that of the purple 
bacterial RCs are similar, there are distinct differences. In the case of 
PSII, it appears that the relatively small energy gap of about 0.1 eV be- 
tween the pheophytins and the chlorophylls allows any electron on the 
“wrong side” pheophytin to thermally equilibrate with other chlorins 
on a rapid enough time scale. This similar energetics of the chlorins in 
PSII means that charge separation need not focus on the central pair 
of chlorophylls (designated Pı and De in Fig. 3 or Pp; and Pps in Zouni 
etal., 2001), as is the case in the purple bacterial reaction center. Indeed, 
it appears that the accessory chlorophyll (designated Chl, in Fig. 3 or 
Chlp; in Zouni et al., 2001) adjacent to the active pheophytin acts as 
the most favored primary donor (Barber and Archer, 2001; Dekker and 
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Van Grondelle, 2000; Diner et al., 2001; Prokhorenko and Holzwarth, 
2000). 

The fate of the resulting “hole” will be governed by the redox differ- 
ences between the four chlorophylls, which all have high redox poten- 
tials in the region of 1V. Since the tyrosine donors Tyrz and Tyrp are 
oxidized at about the same rate (Faller et al., 2001) and are more or less 
equidistant between the Pı and Pg chlorophylls respectively (see Fig. 3), 
then on the first photochemical turnover the hole must be delocalized 
more or less equally between the two chlorophylls Pa and De even at 
low temperatures. This symmetry in electron donation is broken by the 
fact that the Mn cluster which catalyzes the water oxidation process, is 
located only on the D1 side. The consequences of this is that the Pa 
chlorophyll becomes the active P680°* species, presumably because the 
long-lived inactive Tyrp radical cation generated during the first or sec- 
ond photochemical turnover increases the redox potential of Pg relative 
to Pa. Direct evidence that P, does indeed generate the P680°* species 
has come from the recent mutational studies (Diner et al., 2001). The 
electron transfer from Tyrp to Pat may, in part be coupled to proton re- 
lease from the tyrosine which could then facilitate an electron/proton 
abstraction from the substrate water molecules bound to the Mn cluster 
(Tommos and Babcock, 2000). 


3. Charge Recombination in Chains 


One consequence of the common use of redox chains for forward, 
productive electron transfer is the availability of the same redox chains 
for reverse, unproductive charge recombination. Indeed, when these 
recombination reactions are observed in reaction centers, they provide 
some of the best examples of the manner in which redox chains can 
mediate uphill electron transfer en route to a final cofactor that confers 
an overall favorable free energy. Figure 9 presents electron tunneling 
in the physiologically unproductive direction through the chain from 
Qa to BChly in Rba. sphaeroides and the effect of altering the reduc- 
tion potential of Qa on the rate of tunneling and the route taken from 
Qa to BChly+ (Woodbury et al., 1986). Charge recombination from 
the native ubiquinone-10 is dominated by the slow ~10 s~!, tempera- 
ture independent, direct tunneling over 23 A from Q4” to BChls*. The 
native ubiquinone-10 was replaced with a variety of synthetic analogues 
of widely differing but mainly lower reduction potentials so as to steadily 
diminish the free energy gap between Qa and BPh. The lower reduction 
potential also served to increase the driving force of direct Q 47 to BChlyt 
recombination, which causes minor acceleration or slowing depending 
on whether —AG enters the inverted Marcus region (-AG > A) or 
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Fic.9. Substitution of exotic quinones for native ubiquinone at the Q4 site or reaction 
centers from Rba. sphaeroides shows that charge recombination from the BChly+Q, 7 state 
proceeds uphill via a thermally activated intermediate “X” near the energy level of BPh 
(Woodbury et al., 1986). Figure after Gunner (1988). 
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not. However, beyond a certain threshold the route via BPh becomes 
dramatically dominant. The slope is about a 10-fold change in rate per 
0.06 eV change in —AG at room temperature, providing all the signs 
of a thermally activated electron tunneling between Q,a— and BPh, that 
transiently forms BD" en route to BChls*. Further support is provided 
by the direct correspondence of the activation energy of the recombina- 
tion reaction and the free energy gap between Q; and BPh calculated 
from the redox midpoint potentials for a series of quinones. 

Charge recombination does not use the intervening large highly con- 
jugated BPh molecule as a superexchanging element in a single tunnel- 
ing step, but rather establishes a quasi-equilibrium with the BPh some 
0.48 eV uphill. Moreover, at low temperatures when the thermal route is 
essentially eliminated and only single-step direct recombination is possi- 
ble (Gunner etal., 1986), there is no evidence that BPh employs superex- 
change to diminish the effective tunneling barrier and enhance the rate. 

Ofall the reaction center and photosystem species discussed here, only 
Rba. sphaeroides has an energy gap between the BPh and native Qa (or 
their analogues) large enough to eliminate thermally activated charge 
recombination via the (B)Ph or analogue at room temperature. Yet the 
submillisecond recombinations in these other species are still sufficiently 
slower than the forward physiological tunneling to enable stable charge 
separation. 

Figure 10 (see color insert) shows the rate of unproductive charge re- 
combination from flash generated Q4 back to the first and third hemes 
of Rps. viridis through a series of up and down steps covering 100 A. 
Again there is solid matching of measurement (Gao et al., 1990) and 
calculations, even using the unmodified equilibrium redox levels or po- 
tentials of the intervening BPh, BChls and first second and third hemes. 
Recombination rates of seconds ensure that productive charge separa- 
tion will prevail in vivo. Chains of relatively closely positioned cofactors 
appear to be the normal modus operandi for very rapid, highly directed 
electron tunneling over considerable distances. In dramatic contrast to 
the significant exponential penalty of tunneling long distances through 
protein media, the tunneling time of transit through the simplest chains 
in which each step is similar and mildly exergonic is roughly linearly 
dependent on its length. 


C. Simulating Photosystem Operation with Tunneling Theory 


Enough is known now to begin to estimate all the tunneling rates be- 
tween all redox centers in the photosystems for both charge separations 
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TABLE II 
Estimated Redox Midpoint Potentials in Photosystems 
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Cofactor Em Cofactor En“ Cofactor DR 

Rps. viridis RC PSI PSII 

BChl2/Pgrounad 0.50 (Gao et al., P700 0.43 P680grounad 1.15 (Klimov 
1990) et al., 1979) 

Pr —0.79 P700* —1.17 P680anion —0.70 

BChl, —0.71 Chla  -1.14 Chl,* —0.63 

BChlg —0.85 Chlıg —1.14 Chlg —0.70 

BPha —0.63 (Woodbury Chla  —1.1 Pha —0.59 (Klimov 
and Parson, et al., 1979) 
1984) 

BPhg —0.63 Chla  —1.1 Phg —0.59 

Qa —0.15 (Prince et al., Qa —0.74 Qa —0.08 (Krieger 
1976) et al., 1995) 

Or 0.04 (Wraight, Or —0.74 Tyr 1.00 
1979) 

Heme 1 0.38 (Alegria and Fx —0.67 Mn 0.90 
Dutton, 1991) 

Heme 2 0.02 (Alegria and Fa —0.54 
Dutton, 1991) 

Heme 3 0.32 (Alegria and Fp —0.56 
Dutton, 1991) 

Heme 4 —0.06 (Alegria and 


Dutton, 1991) 


“ From Iwaki and Itoh (1994) and Vassiliev et al. (2001). 
» From Rutherford and Krieger-Liszkay (2001). 


and charge recombinations and to simulate the electron transfer reac- 
tions that take place after a flash of light. We use the tunneling expres- 
sion in Fig. 6 if the structural resolution permits estimates of the packing 
density, p, or the simpler tunneling expression of Fig. 2 if structural reso- 
lution does not permit p estimates. Examples follow in Fig. 11 (see color 
insert and Table II). Even without adjusting the usually unmeasured 
reorganization energy, these simulations follow kinetic measurements 
within an order of magnitude of time. 

Tunneling distances between chlorins are so close that it is not an es- 
sential part of photosystem design that any particular chlorin or chlorin 
pair be the source of the initial excitation and charge separation. Details 
of the relative redox levels of each of the excited and ground states will 
determine which particular chlorin or chlorin pair is first oxidized and 
which first reduced. Similar concerns will determine which side of the 
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chlorin chains will be reduced. In the Rps. viridis simulation, the BChls 
has intentionally been given a midpoint potential slightly more negative 
than P* so that electron transfer favors the L side. For PSI a simpler case 
is shown in which both sides have symmetric midpoint potentials; this 
means both chains participate nearly equally in electron transfer, with 
no loss in charge separation efficiency. In PSII, asymmetry in the tun- 
neling distances alone is sufficient to favor electron transfer along the 
chain that eventually leads to Qa although some temporary reduction 
of the chlorins on the other side might reasonably be expected. How- 
ever, the PSII simulation illustrated introduces a slight asymmetry in the 
midpoint potentials so that A side B* is favored as the principal charge 
separating center, in recognition of the recent observations (Diner et al., 
2001). 

Each of the photosystems ejects an electron from the excited chlo- 
rin complex to a quinone within a nanosecond, followed by electron 
transfer along chains leading out of the charge separation center within 
100 ns. The high potential reaction of Tyr and Mn in PSII is quite rapid, 
beginning in the simulations on the same time scale as the quinone re- 
duction reaction. However, it has been suggested that tyrosine oxidation 
may not be rate limited by tunneling, but by H* transfer (Diner et al., 
2001). 

Charge recombination times will depend on the initial state of the pho- 
tosystem being simulated. Under physiological conditions we may expect 
that diffusible species will carry oxidizing and reducing equivalents away 
so that such recombination is unlikely. However, under certain experi- 
mental conditions, these recombination reactions can be observed and 
can provide information about the engineering tolerances of the elec- 
tron transfer system. In the viridis system, the simulations show a recom- 
bination through uphill intermediate redox states in tens of seconds. A 
similar time scale would be expected for the recombination from Q4 to 
the Mn cluster, which is here artificially forbidden to oxidize water. For 
PSI, on the other hand, without any donor to Pt, the simulations antic- 
ipate a recombination from the photoreduced FeS chain on the time 
scale of tens of milliseconds. This is beginning to become an engineer- 
ing concern and shows the importance of donors to P* in stabilizing the 
charge separation reactions. While close positioning of redox centers in 
the photosystems allows rapid enough forward electron tunneling for 
efficient charge separation, the same close positioning offers the possi- 
bility of thermally assisted reverse electron transfer, with accompanying 
loss of efficiency. Donation to Pt removes the time scale of these recom- 
bination reactions to the many seconds, where diffusive reactions can 
ensure overall efficiency. 
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VI. MANAGING PROTON REACTIONS IN PHOTOSYNTHESIS 


A. Design Issues for Proton Transfer 


Whether the process is adiabatic or nonadiabatic (tunneling), the 
range of proton transfer is restricted to distances no more than 1 Ä 
and mechanisms rely exclusively on reactive complex formation. Thus, 
in contrast to electron transfer, the short range, bond-length nature of 
proton transfer necessitates considerations of structure. 

The charged nature of electron and ion transport functions gives pro- 
ton transport two distinct roles in bioenergetics—charge compensation 
and long-distance “chemical” movements. In both cases, proton trans- 
fer involves the “normal” acids and bases of Eigen (1964)—oxygen and 
nitrogen functionalities that can exchange protons at high rates given 
moderately favorable driving force and no hindrance to reactive com- 
plex formation. In solution, the initial encounter complex quickly finds 
the reactive configuration, and the elementary event of proton transfer 
is much faster than dissociation. Thus the reaction is diffusion limited: 


AH+B=AH-:-B=A---HB=A+HB 


If attainment of the reactive complex is inefficient, so as to be kinetically 
significant, more elaborate schemes must be considered. Such rate limi- 
tation is relatively uncommon in solution reactions of normal acids and 
bases, but strong internal hydrogen bonding is one significant source of 
constraint, and the effect can be dramatic (Hibbert, 1986). To the extent 
that buried polar groups in proteins are largely maintained by internal 
hydrogen bonding, we might expect this to be an important factor in 
the design of rapid proton delivery mechanisms in proteins and in the 
gating of proton conduction paths. 

Principles for transmembrane proton transfer have often been sought 
in the properties of “hydrogen-bonded chains” (HBCs). These can be 
formed from protonated functional groups with good proton donor and 
acceptor activity (primarily oxygen-containing species: water, hydroxyl, 
carboxylic acids) and are thought to allow proton transfer by H-bond 
swapping in a modified Grothuss mechanism. Following Onsager, Nagle 
has described two distinct processes necessary for net proton transfer 
through an HBC—hopping of the proton, and turning of the functional 
groups to reorient themselves back to the original configuration (Nagle, 
1987; Nagle and Morowitz, 1978). This has now been well characterized 
by computational studies of the pore-forming antibiotic gramicidin A 
(gA), which contains a single file of H-bonded water molecules and 
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exhibits a very high proton conductance (Chiu et al., 1999; Jordan, 1990; 
Pomes and Roux, 2002). 

Molecular dynamics and Brownian dynamics simulations of gA show 
that proton conduction along the water chain can be an extremely fast 
one-time event. However, it leaves the hydrogen bond network in the 
opposite polarity from the starting configuration. Thus, all functional 
groups (waters) must reorient in order to conduct a second proton. In 
an effective reduction of dimensionality, the water molecules in the gA 
pore are restricted to three hydrogen bonds—two to adjacent waters and 
one to a carbonyl oxygen in the channel wall. This allows the Grothuss 
mechanism to proceed with no breakage of hydrogen bonds, and proton 
hopping in gA has very low activation energy. Transfer of a single proton 
along the whole length of the pore can occur in just a few picoseconds 
(Pomes and Roux, 2002), with pairwise jump rates faster than 10!? s7! 
(Szczeniak and Scheiner, 1985). Unlike hopping, the turn process in- 
volves breaking and making hydrogen bonds for all groups, probably 
sequentially but in a substantially coordinated way. This normally makes 
the reorientation part of the cycle the rate limiting process. 

Some useful principles of fast proton conduction are evident from 
studies of gA (Pomes and Roux, 2002). Effective proton conduction re- 
lies on pathways or networks with sufficiently strong H-bonds that the 
barrier in the proton transfer coordinate is not high. Thermal activity 
can then lower this barrier height sufficiently to allow barrier crossing or 
facile tunneling on the time scale of solvent fluctuations (~ 10-1? s). At 
the same time, the H-bonds must not be too strong, so that thermal fluc- 
tuations can also cause hydrogen bond breakage to allow reorientation 
of the chain. 

Although the secrets of maximal rates of proton conduction are well 
illustrated in gA, multifunctional proteins that couple H* conduction to 
other events do not exhibit well-formed, proton-conducting hydrogen 
bond networks. Indeed, in the bacterial reaction center the putative 
active path is poorly connected by hydrogen bonds detectable in the best 
current X-ray structures (2.2 A resolution; Stowell et al., 1997). Paddock 
et al. (1999) have shown that chemical blockage or a simple mutational 
lesion of this active path diminishes proton transfer rates by at least 1000- 
fold. Thus, the several well-connected (but not quite continuous) files of 
water that are seen in the X-ray structures, reaching toward the Qg site 
from the cytoplasmic side, do not conduct protons at significant rates. 

The implication is that the proton delivery paths are transient and 
highly dynamic entities. This is explicitly demonstrated in the elegant 
X-ray studies of bacteriorhodopsin (bR), where the path for reprotona- 
tion of Asp-96 is not in existence in the early part of the photocycle, 
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but forms only after the isomerization-driven, internal translocation has 
taken place (Sass et al., 2000). 


B. Proton Transfer and Coupled PT/ET 


In biological electron transfer, the role ofH* depends on the strength 
of coupling between the electron and proton: 


1. Long-range electrostatics—weakly coupled pK shifts, dielectric re- 
sponses (relaxation); AG < 20 meV 

2. Short-range (or structurally relayed) electrostatics—local charge 
compensation, chemically specific H-bonding; AG < 150 meV 

3. Strong coupling—direct protonation; H atom transfer; chemical 
specificity, AG ~ 100-1000 meV. 


The extremes of these coupling strengths define two distinct functional 
roles for protons: 


1. Dielectric relaxation and distributed charge compensation, based 
on weak coupling 

2. Bond making/breaking in coupled ET/PT to or from a hydrogen 
carrier, embodying strong coupling 


C. Protein Relaxations 


The light-driven charge separation in the reaction center presents a 
dramatic impulse to the dark-adapted state of the protein. The de novo 
creation of charge sets in train responses that proceed over many orders 
of magnitude of time: the fastest of these relaxations (about 0.1 ns) may 
be crucial in determining the high yield of the primary events, leading to 
formation of [PQ ]* in about 0.1 ns, whereas relaxation processes at 
longer times contribute to the thermodynamic and kinetic stabilization 
of the charge separation and its subsequent utilization (Holzwarth and 
Muller, 1996; McMahon et al., 1998; Woodbury et al., 1994). 

The full extent of relaxation in the PQA” state spans about 120 meV 
over the time range from 100 ps to about 1 ms at room temperature 
(McMahon etal., 1998). Of this, as much as 80% is achieved prior to 1 us. 
Thus, relaxations associated with diffusion-controlled net H* uptake do 
not contribute more than 20-30 meV. However, proton rearrangements 
(intraprotein H* transfer) can certainly be a significant part of the over- 
all response of the protein at much shorter times. 

The major fraction of relaxation in the PFQ,” state is frozen out 
at a temperature of about 100 K, and it is noteworthy that very similar 
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behavior isseen upon dehydration atroom temperature (Clayton, 1978). 
In addition to the known participation of solvent in protein relaxations 
(Beece et al., 1980) a role for internal water should be expected in the 
protein dielectric response. 

Relaxation phenomena are equally evident at the donor side of the 
reaction center. The well studied, fast (<1 us) electron transfer reactions 
from bound c-type cytochromes of RCs from such species as Chromatium 
and Rps. viridis frequently show a progressive shutoff of electron transfer 
from the high potential heme, which is closest to P (Gao et al., 1990). Itis 
suggested that this is due to a large (>100 mV) increase in the midpoint 
potential of the heme, associated with the freezing out of solvent or 
hydration-related relaxation processes (Kaminskaya et al., 1990). This is 
consistent with a deuterium solvent isotope effect for this reaction, as 
reported by Kihara and McCray (1973). 


D. Protonation Archetype: Q4/Qp in Purple Bacteria 


In the bacterial RC, export of reducing equivalents is achieved by dou- 
ble reduction of the secondary quinone, Op, and release of the hydro- 
quinone into the membrane. Two light-activated turnovers of the RC are 
required, with the reduction of P+ by a secondary donor (Okamura etal., 
2000; Shinkarev and Wraight, 1993); the change in protons associated 
with each step is given by the indices m, n, r, and s: 


hv = os nE 
QuQzp—> Qa Up = Oh (1) 
"IT (Ht), (n—m)H* (H+), 
hv 


OOO. = aH Do (2) 


(H+), (rar (H+, (l-r+s)H* (Hr), (1-s)H+ 


The two one-electron transfer events exhibit the two basic roles for pro- 
ton uptake and redistribution—weakly coupled charge stabilization for 
the first electron (m ~ n~ r ~ s < 1), and strongly coupled bond for- 
mation for the second (net uptake of 2H* per QHs). However, it is now 
evident that the two are functionally related, and part of the charge- 
compensating Ht uptake, e.g., on the first electron transfer, is destined 
for delivery to the quinone head group in the second reduction step. 


1. Proton Coupling to the First Electron Transfer 


A description of proton delivery to Qg has been outlined from a com- 
bination of experiment and structure-based calculation of energetics. In 
Rba. sphaeroides, site-specific mutagenesis showed substantial roles for a 
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small cluster of acidic residues, Asp!?!°, Glu'?!?, Asp! "3, and Glu"!73, but 
it was not possible to interpret the results uniquely in terms of energet- 
ics (electrostatics) or direct functional involvement (as proton carriers) 
(Paddock et al., 1989, 1994; Takahashi and Wraight, 1992, 1996). By com- 
bining conformational sampling with electrostatic energy calculations, 
Alexov and Gunner (1999) showed that a small number of residues are 
closely coupled to function in both the energetics and mechanics of 
proton delivery to Op. 

At the purely energetic level, the Qa°-Q3 =QaQpg_ electron transfer 
equilibrium is driven by the differential solvation of Op "` versus Qa’, 
which is largely effected by internal proton shifts within the acid cluster. 
Interactions within the cluster poise the apparent pK values such that, 
in the ground state, Asp!?!° is ionized and Asp ZÜ is protonated, even 
though the latter is more solvent exposed. Arrival of a negative charge on 
reduction of Qg—and, to a significant extent, also Q ,—tips the balance 
in favor of protonation of Asp!?" and ionization of Asp ZIL which results 
in a switch of H-bond connections. In the ground state, Ser?! is a 
H-bond donor to Asp(—)!#!8, Following formation of the semiquinone 
anion, Ser??? becomes a proton donor to Qg and AspH!?!8 now donates 
a H-bond to Serl?3 (Fig. 12; see color insert). Although no proton is 
taken by the semiquinone at this stage, this sets up a connected pathway 
that could be used for proton delivery in the second reduction step. 

More relevant to the first electron process, the proton equilibration 
capabilities of the acid cluster are largely independent of proton uptake 
from the medium. Only about 0.3-0.4 Ht are taken up at neutral pH 
and, remarkably, this is independent of whether the electron is on Qa 
or Oe, i.e., m = n~ 0.3 (differences appear outside the pH 6-8 range) 
(Maroti and Wraight, 1988; McPherson etal., 1988; Okamura et al., 2000). 
This reflects a general lack of ionizable groups near Q4, leading to a low 
dielectric response of the immediate environment and consequent long- 
range influence of the Q,” charge on ionizable groups close to the Qg 
site. The calculated proton distribution is similar but not identical for 
both semiquinones, so proton redistribution contributes to the protein 
relaxation that controls the energetics of the electron transfer. Alexov 
and Gunner (1999) have described the reduction of Q4 as “preparing” 
the Qg site for the subsequent electron transfer, by partially redistribut- 
ing the protonation states of the acid cluster. 

X-Ray structural studies indicate that the ground state of the bacte- 
rial RC contains Qg bound in a distal position that is presumed to be 
inactive with respect to electron transfer ((Ermler et al., 1994; Graige 
et al., 1998; Lancaster and Michel, 1997; Stowell et al., 1997). The inac- 
tivity would arise from a 2 A greater distance from Qa, and a distinctly 
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less polar and less H-bonded environment, which is likely to lower the 
redox potential, making Oe unreducible by Q47. However, free energy 
calculations and molecular dynamics simulations suggest that the equi- 
librium is finely balanced and the proximal position for Qg is favored 
when Qa” is present and the acid cluster is suitably protonated (Alexov 
and Gunner, 1999; Walden and Wheeler, 2001; Zachariae and Lancaster, 
2001). Thus, reduction of Q 4 appears to prepare the On site by bringing 
the quinone into the proximal position, as well as by redistributing the 
protonation states of the protein. 

Depending on the degree of preparation of the Oe site by Qa, the 
first electron transfer may be downhill or uphill. In the latter case, the 
electron transfer equilibrium is pulled over by subsequent relaxations, 
including further internal proton redistributions as well as net H* up- 
take. With artificial quinones as Qa, however, Gunner and co-workers 
have reported a very rapid phase of electron transfer to Qg when Q4 is 
sufficiently low potential, and have shown this component to speed up 
with increasing driving force, indicating rate limitation (Li et al., 1998, 
2000). This result is controversial (Graige et al., 1998), but a reasonable 
interpretation is that the initial electron transfer equilibrium, on the 
l-us time scale, is unfavorable and can only be seen when the equilib- 
rium is artificially enhanced by using low potential quinone analogues 
as Qa. The slower kinetics, in all cases, would then represent the equi- 
librium being pulled over by subsequent Ht uptake and redistribution. 
For the native ubiquinone, the initial equilibrium could be unfavorable 
by as much as 100 meV (Wraight, 1998). 


2. Proton Coupling to the Second Electron Transfer 


Although the acid cluster plays a distinct and identifiable role in the 
relaxation response of the protein on the first electron transfer, electro- 
static calculations indicate that the full energetic contribution is quite 
widely distributed, with small ionization state changes of a number of 
residues. On the other hand, proton delivery to the quinone head group 
during the second reduction step is a targeted function, with discrete 
termini at the quinol oxygens. 

The pathway for proton uptake associated with the first and second 
electron seems to be the same (Adelroth et al., 2000). Both are inhib- 
ited by binding of certain divalent transition metal ions, notably Zn?*, 
NU", Cd?+, and Cu?*, but not Fei, or Ca?* or Mai" (Paddock et al., 
1999; Utschig et al., 1998) and by the double mutation of Asp?! and 
Asp!” (Paddock etal., 2000). The binding site for the inhibitory ions has 
been identified in X-ray structures as involving one of a surface cluster 
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of histidines, either or both of Asp!?!° and Asp™!”, and solvent waters 
(Axelrod et al., 2000). The relative insignificance of net Ht uptake (in 
contrast to internal redistribution) for the energetics of the first elec- 
tron transfer is supported by the lack of effect of metal binding on the 
electron transfer equilibrium. 

For the second electron transfer, the pH and free energy dependence 
of the reaction rate clearly support a mechanism in which protonation of 
Os precedes the electron transfer, which is rate limiting (Graige et al., 
1996) (Fig. 12). It is noteworthy that in this reaction mechanism, an 
energetically unfavorable protonation intermediate (QpH) is accessed 
by thermal energies to allow electron transfer to proceed with a large 
free energy drop. Nevertheless, the electron transfer is still rate limiting, 
largely because of the substantial distance—at 14.5 A it is near the limit 
of the physiological range described by the histogram of Fig. 4. 

It seems reasonable to think that the H-bonded connection between 
On `. Ser!*?3, and AspH!#!°, established after the first electron trans- 
fer, provides the terminal path for delivery of the first proton to the 
quinone, ahead of the second electron. From the X-ray structures, a wa- 
ter molecule is positioned between Asp!?!3 and AspM!” and a complete 
pathway could comprise 


Q, —Ser!??8—AspH!?!3_H,O—Asp™!”— (aq) 


With this arrangement, delivery of Ht from AspH!?!? to form QgH 
could be in extremely rapid equilibrium—albeit unfavorable because 
of the low pK of On compared to Graige et al. (1999)—but equilibra- 
tion with the bulk phase would require rotation of Asp!?13, H>O, and, 
probably, Asp™!”. None of these presents obvious obstacles and yet mu- 
tation of AspM!” (or Asp!?!0) alone has little or no effect on the second 
electron transfer and its associated proton deliveries. Only when both 
Asp™!” and Asp!*!° are substituted (by asparagine) is the proton path 
inactivated. 

It is not clear how this apparent cooperativity between Asp“ ’ and 
Asp!?!0 works in the proton pathway, nor whether the severe inhibi- 
tion due to double mutation, or binding of metal ions, arises from an 
energetic or a kinetic source. It may be that the proton path is actually lo- 
cated between Asp!?!° and the metal binding site, involving both Asp™!” 
and Asp ZIL. In the X-ray structures, this region contains significant void 
volume and appears to be accessible to solvent. This suggests the possibil- 
ity that a conducting path is dynamically assembled from solvent water, 
possibly requiring the good hydrogen bonding attributes of the acidic 
groups. This would put it in a similar category as bacteriorhodopsin, and 
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as proposed for cytochrome oxidase, even though the RC has no evident 
need to gate its proton transfers. 

In fact, transient assembly of H-bonded water files is probably com- 
mon in enzyme function. In carbonic anhydrase, for example, the rate- 
limiting step is proton transfer from the active-site Zu?" Oz complex 
to the surface, via a transient, H-bonded water network that conducts 
H". Analysis of the relationship between rates and free energies (p K dif- 
ferences) by standard Marcus theory shows that the major contribution 
to the observed activation barrier is in the work term for assembling the 
water chain (Ren et al., 1995). 

Although the well-formed H-bonded file of water in gramicidin may 
be atypical in the wider realm of proton conduction, the principles of 
proton conduction gained from gA can probably be generalized to the 
transient pathways seen in other systems. Even transient networks or files 
of water can be expected to be quite strongly H-bonded; their transient 
nature is likely to be more of entropic than enthalpic origin. The sym- 
metry of the acceptor-donor pairs of waters, or otherwise similar func- 
tional groups (with similar pK values), will also contribute to keeping 
a low barrier in the proton transfer coordinate. At the same time, ther- 
mal fluctuations are clearly sufficient to disrupt the hydrogen bonds, 
leading to disassembly of the chain. Studies on gramicidin also illus- 
trate that the presence of an ion, for example a sodium in the channel 
mouth, can stabilize the file of waters (Jordan, 1990). This suggests the 
possibility that the proton path in bacterial RCs, or in the channels of 
cytochrome oxidase, could assemble in response to specific charge states 
of the active site—Qp , Qa Un , QpH , etc., in RCs, and various inter- 
mediates of the redox cycle in oxidase. Of course, in bacteriorhodopsin 
the assembly and function of the reprotonation pathway is already being 
described in structural detail (Lanyi and Luecke, 2001); the energetic 
drive is provided by the charge and dipole movements associated with the 
photoisomerization and the initial charge transfer from the Schiff base. 

The interesting issue of proton collection at the protein surface is not 
addressed here, but it does appear that proton antennae are designed 
surface features of several proton translocating proteins, including the 
bacterial RC (Adelroth et al., 2001), cytochrome oxidase (Marantz et al., 
1998), and bR (Checover et al., 1997). 





E. Protonation in the Oxidation Reactions of Photosystem IT 


The oxidation of water by Photosystem II in oxygenic photosynthesis 
is an energetically and mechanistically challenging process, in which 
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proton transfers play a critical role. The oxidizing potential generated 
by the reaction center chlorophylls (P680) is in excess of 1.1 V and the 
average potential of the water-oxidizing Mn complex must exceed the 
operating potential of the O2/H2O couple (approximately 870 mV at 
pH 6). Some of the individual oxidation states of the Mn complex (the 
S-states) exhibit redox potentials within 150 mV of that of P680t (Vass 
and Styring, 1991). To control possible side reactions, P680+ must be 
rereduced rapidly with minimal free energy loss. 

The immediate electron transfer to P680* occurs from a tyrosine 
residue, Yz, residue 160 in the D1 subunit of PSII, with a lifetime of 
20-40 ns under optimal conditions and a free energy drop of only 
40 mV (Meyer et al., 1989). The identifiable oxidized form of Yz is a 
neutral radical: 


YzÄOH Pt = Y,O°P + Ht 


The nature of the electron/proton coupling in this reaction is a topic of 
great current interest. Unfortunately, much of the available experimen- 
tal data comes from studies on Mn-depleted PSII preparations, which are 
incapable of water oxidation and which exhibit grossly altered kinetics 
of P680* reduction. It is clear for such preparations that abstraction of 
the proton from Yz is rate-limiting for net electron transfer, although, 
even for this system, it is not known if the proton is removed before or 
after electron transfer (Hays et al., 1999). 

In the intact, oxygen-evolving system, the oxidation of water by se- 
quential electron transfers is accompanied by H* release into the bulk 
phase. The stoichiometry of proton release depends on conditions, in- 
cluding pH. As is well established for the H* uptake by bacterial RCs, the 
released protons are presumed to include weakly coupled “Bohr” pro- 
tons, arising from pK shifts in amino acid residues under the influence 
of the charge accumulation in the Mn complex, as well as strongly cou- 
pled “chemical” protons released upon oxidation of the Mn complex, 
including associated ligands. These proton events obscure those associ- 
ated with the kinetically critical oxidation of YZOH, for which the fate 
and timing of the proton abstraction is crucial. Thus, our understanding 
of this process is largely based on thermodynamic and activation energy 
arguments. 

The basic issue is that the oxidation potential for Y,OHt/ YzOH is 
about 1.5 V in solution, too high to be an effective intermediate be- 
tween P680+ and the Mn complex. Given that YZ,OH is the dominant 
reduced form, is this reactivity problem circumvented by rapid proton 
trasfer after electron transfer or by a low level of YO” produced in a 
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preequilibrium? And can the Yz redox potential be brought down to 
earth by local means, or must the Ht be released to the bulk phase? 
Babcock and Tommos (Tommos and Babcock, 1998) consider that Ht 
from Y,OH* must be released to the bulk phase (or at least very far from 
Yz). [This is independent of the proposal that YzO° abstracts a hydrogen 
atom from the Mn complex (Tommos and Babcock, 1998), rather than 
an electron (Gilchrist et al., 1995).] 


Y,OH Pt = Y,OHtPt = Y,O°P + H* (bulk) 


Rappaportand Lavergne (2001), however, have argued that the reactivity 
of Yz is controlled by H-bond formation that results in a low level of Va" 
formation, with rapid local proton transfer preceding electron transfer: 


Y,OH Pt = WO P+ = Y,0°P 
(HF) (H*) 


In particular, they doubt that proton transfer to the bulk phase can occur 
sufficiently rapidly to account for the electron transfer kinetics (20 ns at 
pH 5), and hence favor the preequilibrium shown above. Although their 
energetic analysis is insightful, the kinetic conclusions are unwarranted 
at the present time. Proton equilibration from many protein surfaces 
into the bulk phase is known to be slow (Gopta et al., 1999; Haumann 
and Junge, 1994; Heberle et al., 1994; Maröti and Wraight, 1997) but 
whether the proton actually makes it to the bulk phase is somewhat moot 
from the point of view of the Yz reaction. There is no doubt that proton 
transfer over a sufficient distance could be effected in a few picoseconds, 
given a suitable conduction path, as evidenced by gramicidin. However, 
whether such a structure might exist must await significant improve- 
ments in the PSII X-ray structure. The basic question of timing (proton 
first or electron first) is fundamental to a mechanistic description, as it 
was for the second electron transfer to Qg in bacterial RCs, and it seems 
unlikely that it will be resolved without similar experimental input. 


F. Is Proton Transfer Design Robust? 


The charge compensation role of intraprotein proton transfer is an 
essentially dielectric response and is not structurally specific. Transfer 
of a proton requires specificity of contact, but a variety of geometries 
and protein motifs can fulfill the underlying requirement of charge re- 
distribution. Thus, many second site mutations can recover significant 
function of the first electron transfer in bacterial RCs. On the other 
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hand, efficient delivery of protons to a unique end point requires much 
greater structural specificity. In spite ofthe inherently dynamic nature of 
proteins, their packing is such that even small, neutral molecules (such 
as water) cannot penetrate deeply at will (Simonson and Perahia, 1995), 
which makes proton pathways for long-distance transport highly specific. 
In the bacterial RC, H* entry occurs at a single site and intraprotein pro- 
ton transfer occurs via a pathway that is not yet fully defined but is very 
sensitive to localized modifications. The Aspl?!? mutations that inhibit 
proton delivery to Qg” can be reverted by several second site mutations, 
but the kinetic competency is generally marginal—with the exception of 
the Asn™“4_; Asp suppressor mutation. The latter is physically very close 
by and it seems likely that Asp™“* could substitute directly for Asp'?!°, 
thereby maintaining essentially the same gross structure. Similarly spe- 
cific design features are evident in the proton channels of oxidase and 
bR. Thus it appears that the physical contact needed for proton move- 
ment toward and away from specific energy coupling sites in proteins 
can tolerate some degree of mutational change while preserving func- 
tion, but that it is generally significantly less robust that the long-distance 
energy and electron transfer functions of these proteins. 

The constraints are even more severe at each of the bond making 
and breaking catalytic sites in photosynthetic systems and their accom- 
panying redox partners. In each catalytic site (the Qg sites of the purple 
reaction centers and PSII, the quinone oxidation and reduction sites 
of bq and bsf, and in the water splitting site of PSII) multiple electron 
transfers are coordinated with multiple proton transfers. The electron 
transfer side of these reactions appears to take place over short enough 
electron tunneling distances that they are relatively facile and robust. 
The geometric and kinetic restraints of water and proton management 
around these catalytic sites, however, could very well be the principal 
engineering constraint that limits the turnover of these sites and thus 
turnover of the bioenergetic systems as a whole. Because of this sensitiv- 
ity, we might expect these sites to be subjected to the most evolutionary 
selective pressure and to be the least robust with respect to mutational 
changes. 


VII. MANAGING DIFFUSION IN PHOTOSYNTHESIS 


Although diffusion is a slow process compared to energy transfer and 
electron transfer at the shortest distances, it can be an exceptionally 
effective way to move electrons and protons over long distances. How- 
ever, unlike the hard-wired cofactor chains that guide electron transfer 
in protein complexes, diffusion faces the problem of directing where 
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the diffusing complexes go as well as the problem of avoiding un- 
wanted side reactions with unproductive partners. These engineering 
problems are addressed in the diffusing cytochrome c (or analogous 
plastocyanin) population by exploiting the protein’s surface charge dis- 
tribution near the electron transfer center to make it more likely that 
it will dock and come into tunneling distance only with the appropri- 
ate reaction partners. The same surface charge distribution also tends 
to keep the cytochrome associated with the membrane surface. This 
greatly restricts a large three-dimensional diffusion volume to an es- 
sentially two-dimensional diffusion plane, shortening the time between 
successful encounters. Encounter time is also limited by having a pool 
of cytochrome in significant excess over its redox partners. 

By virtue of its hydrophobicity, quinone is confined to the membrane, 
which similarly limits the diffusional volume. Steering charges are out 
of the question given quinone’s hydrophobic environment. However, 
it can control its reactivity with unwanted reaction partners by virtue 
of the tight coupling of the two-electron oxidation-reduction reaction 
while in the membrane. Any adventitious redox partner with the po- 
tential of transferring electrons must address the high instability of the 
semiquinone (SQ) product in the free lipid environment. The quin- 
one stability constant, [SQ]?/[Q][QHg], has been estimated as 10-10 
(Dutton et al., 2000); thus even at redox equilibrium with the quinone 
pool halfreduced and halfoxidized, the concentration ofsemiquinoneis 
small, on the order of one partin 100,000. This makes QHo a poor reduc- 
tant and Q a poor oxidant unless the quinone is in a protein site which 
is designed to either stabilize the half-reduced SQ state, or to provide 
the bound quinone with two ready and rapid electron transfer partners. 

The ultimate way to control diffusion and reactivity with redox 
partners is to restrict diffusion by anchoring a portion of the redox 
molecule and allowing essentially only one-dimensional diffusion. This 
is effectively the case for the FeS center of the ber complex, which has a 
mobile head group with a surface-exposed FeS center, but also a trans- 
membrane anchor secured to the membrane portion of the bc; com- 
plex. This severely restricts the range of motion (~16 A) but perfectly 
controls the problem of guiding electron transfer. Diffusion over this 
distance should be on the submicrosecond time scale, much faster than 
the catalytic turnover of the complex. In a certain sense, this restricted 
diffusion has properties that lie between unrestricted diffusion and fixed 
redox cofactor chains: a sort of chain with moving parts. 

By exploiting the relative distance dependences of electron tunnel- 
ing and diffusion, it seems possible that this restricted FeS diffusion 
may play a role in regulating the action of the n = 2 quinone binding 
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site, by allowing the removal of only one electron from the quinone 
site during the time interval it takes to swing from the quinone site to 
its other redox partner, fixed cytochrome cı (Darrouzet et al., 2000). 
This allows the other electron to move onto the b cytochromes and 
drive transmembrane field generating electron transfer. Such a regulat- 
ing swinging motion would be akin to the escapement mechanism of 
a clock, which regulates the drop of the weights, or the unwinding of 
the spring according to the extreme movements of the swinging pen- 
dulum. Figure 13 (see color insert) shows the essential elements of this 
action. The FeS subunit is shown in yellow and green as it revealed in 
analogous crystal structures in the presence or absence of a quinonoid 
inhibitor myxothiazol (yellow). While the anchoring transmembrane he- 
lix remains virtually unchanged, the redox active FeS cluster (red) moves 
within electron tunneling distance of one or the other redox partner, 
cytochrome cı or the quinone, but it cannot be in tunneling contact 
with both at the same time. 

These various diffusion strategies used by cytochrome c, plastocyanin, 
and quinone are clearly successful, because electron transfer throughput 
is not limited by pool action, even in tissues that need to support a 
great deal of electron transfer activity such as bee flight muscle (Suarez 
et al., 2000). These bioenergetic systems are diffusion-coupled rather 
than diffusion-limited. It seems that chemistry at catalytic sites is the 
ultimate throughput restrictor and constrains the power that can be 
produced by bioenergetic systems. 


VIII. SUMMARY 


The design of photosynthetic systems reflects the length scales of 
the fundamental physical processes. Energy transfer is rapid at the few 
angstrom scale and continues to be rapid even at the 50-A scale of the 
membrane thickness. Electron tunneling is nearly as rapid at the short- 
est distances, but becomes physiologically too slow well before 20 A. 
Diffusion, which starts out at a relatively slow nanosecond time scale, 
has the most modest slowing with distance and is physiologically compe- 
tent at all biologically relevant distances. Proton transfer always operates 
on the shortest angstrom scale. The structural consequences of these 
distance dependencies are that energy transfer networks can extend 
over large, multisubunit and multicomplex distances and take leaps of 
20 A before entering the domain of charge separating centers. Electron 
transfer systems are effectively limited to individual distances of 15 A or 
less and span the 50 A dimensions of the bioenergetic membrane by 
use of redox chains. Diffusion processes are generally used to cover the 


106 MOSER ET AL. 


intercomplex electron transfer distances of 50 Ä and greater and tend 
to compensate for the lack of directionality by restricting the diffusional 
space to the membrane or the membrane surface, and by multiplying 
the diffusing species through the use of pools. Proton transfer reactions 
act over distances larger than a few angstroms through the use of clusters 
or relays, which sometimes rely on water molecules and which may only 
be dynamically assembled. 

Proteins appear to place a premium on robustness of design, which 
is relatively easily achieved in the long-distance physical processes of en- 
ergy transfer and electron tunneling. By placing cofactors close enough, 
the physical process is relatively rapid compared to decay processes. Thus 
suboptimal conditions such as cofactor orientation, energy level, or re- 
dox potential level can be tolerated and generally do not have to be 
finely tuned. The most fragile regions of design tend to come in areas of 
complex formation and catalysis involving proton management, where 
relatively small changes in distance or mutations can lead to a dramatic 
decrease in turnover, which may already be limiting the overall speed of 
energy conversion in these proteins. Light-activated systems also face a 
challenge to robust function from the ever-present dangers of high re- 
dox potential chemistry. This can turn the protein matrix and wandering 
oxygen molecules into unintentional redox partners, which in the case 
of PSII requires the frequent, costly replacement of protein subunits. 
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I. INTRODUCTION 


Transmembrane ion pumps contain two functional components: the 
catalytic site for the driving reaction and the machinery for translocat- 
ing the transported ion from one membrane surface to the other. The 
coupling of the first to the second during the transport cycle must be, 
in the various pumps, through displacements of protein subunits, main 
chains, or side chains and bound water, which will modulate the binding 
affinity for the transported ion and the continuity of the pathway for its 
movement across the membrane. Bacteriorhodopsin, a light-driven ion 
pump in halobacteria, is a simpler system than others, because in this 
small seven-helical protein transport is driven not by a chemical reaction 
but by the free energy gained upon photoisomerization of the retinal to 
13-cis,15-anti. In this protein therefore, the ion translocation induced by 
rearrangements in the protein can be studied without complications of 
the driving reaction. 

Over the years, a very large amount of static and time-resolved spec- 
troscopy of various kinds, as well as studies of site-specific mutants of 
bacteriorhodopsin, has generated kinetic models for the transport cy- 
cle (“photocycle”) and identified the side chains of importance (Haupts 
et al., 1999; Lanyi and Varo, 1995; Oesterhelt, 1998). The results had 
begun to identify the molecular events that underlie the interconver- 
sions of the spectroscopically distinct intermediate states termed J, K, 
L, M, N, and O. Together with low-resolution 3D maps of the protein 
and some of the intermediate states from cryoelectron microscopy of 2D 
crystals, these results suggested the beginnings of a mechanistic model 
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(Herzfeld and Tounge, 2000; Lanyi, 1998; Luecke, 2000; Subramaniam 
and Henderson, 2000a). In these models the Schiff base region under- 
goes local changes in the K to L reaction in response to the retinal 
isomerization, and the assumption was that these changes cause direct 
proton transfer from the retinal Schiff base to Asp-85 in the L to M 
reaction. The latter in turn brings about the release of a proton to the 
extracellular surface. It was generally held that the directionality of the 
transport originates from the switch of the connectivity of the unproto- 
nated Schiff base in what was described as the Mi to Mg reaction, from the 
extracellular to the cytoplasmic side, allowing Asp-96 to reprotonate the 
retinylidene nitrogen in the Mg to N reaction. Concerted reprotonation 
of Asp-96 from the cytoplasmic surface and reisomerization of the reti- 
nal to all-trans occurs in the N to O transition, followed by recovery 
of the initial state as the O state decays in a strongly unidirectional 
reaction. 

More recently, high-resolution crystal structures of the unilluminated 
state of the protein (Belrhali et al., 1999; Luecke et al., 1999a) and photo- 
intermediates (Edman et al., 1999; Facciotti et al., 2001; Luecke et al., 
1998, 1999b, 2000b; Royant et al., 2000) from three-dimensional crystals 
grown in cubic lipid phase supplied a molecular rationale to this mech- 
anism and identified the way the retinal interacts with the protein and 
causes its conformational changes. The structures revealed a highly po- 
larized water molecule hydrogen bonded to the positively charged Schiff 
base and to a pair of negatively charged aspartates, Asp-85 and Asp-212, 
and the presence of a hydrogen-bonded network of polar residues and 
bound water in the extracellular region. Coupling of the protonation 
state of Asp-85 to the protonation state of the as yet unidentified group 
that releases a proton to the extracellular surface is through the shut- 
tling of the Arg-82 side chain between two alternative positions, inward 
and outward. Initially, the cytoplasmic region lacks the means to conduct 
an ion, and the pK, of the proton donor Asp-96 is high. Lowering this 
pKa and building a hydrogen-bonded chain to the Schiff base is accom- 
plished by repacking of side chains between helices F and G and the 
entry of water molecules, through the relaxation of the polyene retinal 
chain, which pushes the 13-methyl group against the indole of Trp-182 
and distorts the side chain of Lys-216. 

The new structural insights have also posed new problems. One of 
these concerns the first and critical deprotonation/protonation event 
that involves the Schiff base and Asp-85. What is the direction of the Schiff 
base N-H bond in L, i.e., after the light-induced isomerization of the reti- 
nal but before its deprotonation and the protonation of Asp-85? Does its 
orientation allow direct proton transfer from the Schiff base to Asp-85, 
or does it now point toward the cytoplasmic direction as expected from 
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the unstrained configuration of 13-cis,15-antiretinal? An alternative to di- 
rect proton transfer would be the dissociation of the water molecule that 
bridges the Schiff base and Asp-85. In this case the transiently formed 
hydroxyl anion would be moving to the cytoplasmic side rather than 
a proton to the extracellular side (Luecke, 2000). Questions have also 
arisen about the conformational changes in the cytoplasmic region that 
first allow proton exchange between the Schiff base and Asp-96 with- 
out communication with the bulk, but then allow the reprotonation of 
Asp-96 from the cytoplasmic surface. Is there a single conformational 
change in the photocycle (Subramaniam and Henderson, 2000a), or 
are there several successive (Balashov, 2000; Dioumaev et al., 2001) con- 
formations? Are these transient conformations similar to the altered 
structures assumed by various mutants even without illumination? 

This review will summarize and critically evaluate the crystallographic 
data for bacteriorhodopsin and its photointermediates. We will attempt 
to correlate the structural with the nonstructural data in order to explore 
the various and often contradictory mechanistic conclusions drawn. 

Currently, the Protein Data Bank (PDB; Berman et al., 2000) con- 
tains 33 atomic coordinate entries of bacteriorhodopsin structures, an 
increase of over 50% from 2 years earlier, when there were 20 entries 
(Luecke, 2000). This large number of models attests to the fact that bac- 
teriorhodopsin is one of the most-studied and best-understood integral 
membrane proteins. Three of these coordinate entries are theoretical 
models (1BAC, 1BAD, 1115) and three are NMR structures of fragments 
(1BCT, 1BHA, 1BHB), which will not be discussed further in this re- 
view. Of the remaining 27 entries (Table I), six are electron diffraction 
structures [lBRD, 2BRD (which is mislabeled as an X-ray diffraction 
entry), LAT9, 2AT9, 1FBB, 1FBK], with the first two representing the 
pioneering work by Henderson and co-workers that produced the first 
atomic-level models of bacteriorhodopsin at 3.5 Ä resolution (Grigorieff 
et al., 1996; Henderson et al., 1990). The remaining 21 entries are X-ray 
crystallographic structures of wild type and mutants in various states and 
to varying resolutions (almost double the 11 X-ray models 2 years ago!). 
Ten of these describe the ground (light-adapted or resting) state (1AP9, 
1BM1, 1BRR, 1BRX, 1C3W, 1C8R, 1F50, 1KGB, 1OH]J, 1OM8). An addi- 
tional eight models represent various cryotrapped intermediates: two for 
a low-temperature K intermediate (QKO/1QKP), one for an L interme- 
diate (1EOP), and no fewer than five models describe the structures of 
various M intermediates (1C8S, late M from D96N mutant; 1F4Z, early 
M from E204Q mutant; 1CWQ, M from wild type; 1DZE M, M from 
wild type; 1KG8, early M from wild type). A set of two recent structures 
describes bacteriorhodopsin mutants that function as light-driven chlo- 
ride pumps: the D85S single-site mutant (1JV7) and the D85S/F219L 
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TABLE I 


Electron Diffraction and X-Ray Diffraction Models of Bacteriorhodopsin Currently Deposited in 
the Protein Data Bank“? 








PDB code Description Resolution (A) Method 

1BRD First atomic bacteriorhodopsin 3.5 ED 
structure 

2BRD Refinement of IBRD 3.5 ED 

1AT9 Wild type 3.0 ED 

2AT9 Refinement of 1AT9 3.0 ED 

1FBB Native, based on 2BRD 3.2 ED 

1FBK Triple mutant with 3.2 ED 
cytoplasmically open 
conformation 

1AP9° First CLP structure, merohedral 2.35 X-ray, CLP 
twinning not recognized 

1BRX Wild type, ground state, 2.3 X-ray, CLP 
merohedral twinning 

1BM1 Wild type, ground state 3.5 X-ray, via spherical 

vesicles 

1BRR Wild type, ground state, trimer 2.9 X-ray, detergent 
in the asymmetric unit 

1AP9 Further refinement of 1AP9,° 2.35 X-ray, CLP 
taking twinning into account 

1QHJ Wild type, ground state 1.9 X-ray, CLP 

1C3W Wild type, ground state, 1.55 X-ray, CLP 
twinning 

1C8R D96N mutant, ground state 1.8 X-ray, CLP 

1C8S D96N mutant, late M intermediate 2.0 X-ray, CLP 
with 100% occupancy 

1QKO/1QKP Wild type, low temperature 2.1 X-ray, CLP 
K intermediate with 35 % 
occupancy 

1CWQ Wild type, mixture of early & 2.25 X-ray, CLP 
late M, and N intermediates with 
35% occupancy, twinning 

1QM8 Based on 1BM1, wild type, 2.5 X-ray, via spherical 
ground state vesicles 

IDZE* Wild type, M intermediate with 2.5 X-ray, via spherical 
100% occupancy vesicles 

1EOP* Wild type, L intermediate with 36 % 2.1 X-ray, CLP 
occupancy, twinning 

1F50 E204Q mutant, ground state 1.7 X-ray, CLP 

1F4Z E204Q mutant, early 1.8 X-ray, CLP 


M intermediate with 100% 
occupancy 





(Continued) 
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TABLE I (Continued ) 








PDB code Description Resolution (A) Method 
1JV7 D85S mutant without halide, 2.25 X-ray, CLP 
O-like state, head-to-tail 
dimers 
1JV6 D85S/F219L double mutant 2.0 X-ray, CLP 


without halide, O-like state, 
head-to-tail dimers 


1KGB Wild type, ground state 1.65 X-ray, CLP 

1KG8 Wild type, early M intermediate with 2.0 X-ray, CLP 
100% occupancy 

1KME Wild type, crystallized from 2.0 X-ray, from 
bicelles, head-to-tail dimers bicelles 


“In order of publication in the respective category. 

’ Cryo-trapped intermediates are rendered in italics. 

€ As published (Pebay-Peyroula et al., 1997). 

4No publication since deposition in February 2000, when the title was “Sliding of G- 
helix in bacteriorhodopsin during proton transport.” 

"For which the original publication and the PDB file specify 70% L occupancy, an 
estimate that was later reduced to 36% by the same group, along with 12% K and 12% 
M contamination. 

SED, electron diffraction; CLP, cubic lipid phase. 


double mutant (1JV6), both of which were determined in the absence 
of halides and show a significantly opened extracellular half channel 
with respect to the other BR structures (Fig. 1, see color insert; Facciotti 
et al., 2001). Because these mutants contain an uncharged residue at 
position 85 as well as all-trans retinal they are thought to reveal a glimpse 
of the O intermediate of the photocycle. Finally, there is one additional 
entry being processed at PDB (1KME) with the title “Crystal structure 
of bacteriorhodopsin crystallized from bicelles” which describes a novel 
third approach to crystallizing membrane proteins (Faham and Bowie, 
2002), distinct from vapor diffusion (Michel, 1991) and the cubic lipid 
phase method (Landau and Rosenbusch, 1996). 

Even though this proliferation of structures has prompted some to 
call bacteriorhodopsin the “lysozyme of membrane proteins,” BR does 
not yet present a real challenge to X-ray-derived lysozyme models in the 
database, which currently number 680! 


I. THE GROUND, OR RESTING, STATE 


The ground state of bacteriorhodopsin, also called the initial, light- 
adapted, BR, or resting state, refers to the protein with a relatively 
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relaxed, protonated all-trans retinal and is characterized by an ab- 
sorption maximum of 568 nm. Its 3D structure was first reported 
by the ground-breaking electron diffraction work on 2D crystals of 
Henderson and co-workers at 7 Ä resolution (Henderson and Unwin, 
1975). Subsequently, the same group improved the resolution to 3.5 Ä, 
which resulted in the first atomic model of an ion pump (Grigorieff et al., 
1996; Henderson et al., 1990). Meanwhile, in an effort to complement 
the findings of intense spectroscopic research aimed at elucidating the 
key functional groups and the kinetics of the photocycle, many groups 
had been working hard to obtain well-diffracting 3D crystals for X-ray 
diffraction experiments. A breakthrough came with the development of 
a novel method for the crystallization of membrane proteins, the cubic 
lipid phase (CLP) method (Landau and Rosenbusch, 1996). 

Although the first bacteriorhodopsin structure reported using this 
new method suffered from severe merohedral twinning that was not ac- 
counted for (Pebay-Peyroula et al., 1997), the recent explosion of struc- 
tural knowledge for bacteriorhodopsin clearly has been driven by this 
new method (Table I). This method was also employed in the determi- 
nation of the high-resolution structures of two related integral mem- 
brane proteins: halorhodopsin, a light-driven chloride pump (Kolbe 
et al., 2000), and sensory rhodopsin II, a photoreceptor (Luecke et al., 
2001; Royant et al., 2001a). 

With merohedral twinning taken into account, crystals from CLP al- 
lowed the first localization of key waters, including highly polarized 
water 402 between the positively charged Schiff base and two negatively 
charged aspartic acids, Asp-85 and Asp-212 (Luecke et al., 1998). More 
recently, crystals from CLP have yielded very high resolution structures 
that describe accurately the conformations of the retinal, the main chain 
and side chains, as well as the locations of all ordered internal waters 
(Belrhali et al., 1999; Luecke et al., 1999a). 

In the extracellular half-channel an extensive three-dimensional 
hydrogen-bonded network of protein residues and seven water 
molecules leads from the buried, positively charged retinal Schiff base 
via water 402, a pair of negatively charged aspartic acids (Asp-85 and 
Asp-212), positively charged Arg-82, and finally a pair of glutamic acids 
(Glu-194 and Glu-204) to the extracellular surface. In the ground state 
the long, flexible guanidinium side chain of Arg-82 is in a position 
roughly midway between the two aspartates above it (Asp-85 and 
Asp-212) and the two glutamates below it (Glu-194 and Glu-204) (Fig. 2, 
see color insert; Luecke et al., 1999a). The proton that is released to the 
extracellular surface after the Schiff base has deprotonated and Asp-85 
has protonated in the L to M transition is thought to initially reside 
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either on the pair of glutamic acids (Essen et al., 1998) or on a network 
of waters in their vicinity, collectively called the proton release group 
(Spassov et al., 2001). 

Near Lys-216, to which the retinal is covalently attached, transmem- 
brane helix G contains a x-bulge that causes a nonproline kink in 
the helix. This bulge causes the peptide plane between Ala-215 and 
Lys-216 to tilt away from the helix axis, and locally disrupts the a-helical 
hydrogen-bonding pattern. In the ground state this x-bulge is stabilized 
by hydrogen bonding of the main-chain carbonyl groups of Ala-215 and 
Lys-216 with two buried water molecules located in the otherwise very 
hydrophobic region between the Schiff base and Asp-96 in the cytoplas- 
mic region. The water that hydrogen bonds to the C=O of Ala-215 in 
turn accepts a hydrogen bond from the indole nitrogen of Trp-182, a 
residue in van der Waals contact with the polyene chain of the retinal, 
in particular with the C-13 methyl group (Luecke et al., 1999a). 

Asp-96, a key residue in the middle of the relatively hydrophobic cy- 
toplasmic half-channel about 11 Å from the protonated Schiff base, is 
protonated in the ground state. Its only polar interaction stems from a 
hydrogen bond to the hydroxyl of Thr-46. Otherwise it is surrounded 
by a hydrophobic barrel whose walls are formed by residues Ile-45, Leu- 
223, and Leu-224, and the lids by Phe-42, Leu-99, and Leu-100 on the 
cytoplasmic side, and by Val-49, Leu-93, and Phe-219 Schiff base side. In 
the M to N transition of the photocycle, reprotonation of the Schiff base 
from Asp-96 can occur only after this hydrophobic region between the 
Schiff base and Asp-96 has been populated by a chain of water molecules 
(Luecke et al., 2000b). 

Between the molecules 18 tails of ordered lipids were identified, con- 
stituting near-complete bilayers in the 3D crystals. During analysis a sur- 
prising amount of shape complementarity between the hydrophobic 
lipid tails and the membrane-embedded hydrophobic surface of bacte- 
riorhodopsin became evident (Fig. 3, see color insert). This specificity of 
protein-lipid interactions contradicts older views that bilayer lipids sim- 
ply provide a hydrophobic fluid compartment for integral membrane 
proteins. Each layer of the hexagonal 3D crystals has the same arrange- 
ment as the naturally occurring 2D crystals of bacteriorhodopsin, called 
“purple membrane,” and the photocycle has been shown to proceed 
with similar kinetics in the 3D crystals (Heberle et al., 1998). Additional 
evidence for the importance of specific lipids for protein-lipid con- 
tacts comes from the observation that highly purified bacteriorhodopsin 
preparations fail to crystallize in the cubic lipid phase (unpublished re- 
sults). Our interpretation is that native lipids are stripped away during 
excessive purification, lipids that are essential in forming the highly 
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ordered 2D layers in the CLP 3D crystals. Furthermore, a squalene 
molecule was identified in the middle of the bilayer near the z-bulge of 
helix G (Luecke et al., 1999a), providing a structural angle to reports that 
squalene is present in purple membrane and that it affects the kinetics 
of the photocycle (Joshi et al., 1998). A review by Hendler and Dracheva 
(2000) favors a location of the entirely hydrophobic squalene at the 
cytoplasmic surface near Asp-36 and Asp-38. Although the photocycle 
in the crystals is only slightly perturbed, the review mentions the possi- 
bility that detergent extraction and CLP crystallization in the presence 
of nonnative lipids (monoolein) might give rise to perturbations of the 
native protein-lipid arrangement. 

Recent improvements in the details of the CLP crystallization proce- 
dure and crystal treatment have led to even better diffracting crystals. At 
1.4 A resolution, refinement, including anisotropic B factors, illustrates, 
the thermal motion anisotropy of the retinal and key neighbors (Fig. 4, 
see color insert; manuscript in preparation). 


III. EARLY PHOTOCYCLE INTERMEDIATES (K AND L) 


Within a few picoseconds after absorption of a photon, which de- 
posits about 50 kcal/mol of energy into the retinal, and with a quan- 
tum yield of about 67% (Govindjee et al., 1990), the all-trans retinal of 
the ground state is isomerized to a strained 13-cis,15-anti configuration, 
with the strain giving rise to large-amplitude hydrogen-out-of-plane vi- 
brations (Braiman and Mathies, 1982; Rothschild et al., 1986; Siebert and 
Mantele, 1983). The resulting K intermediate displays a slightly red- 
shifted absorption maximum (Amax = 590 nm). The K intermediate has 
a reported AH of 11.6 kcal/mol (Birge et al., 1991), thus about 20% of 
the photon energy is converted to enthalpy. At room temperature the 
pump cycle proceeds from the K intermediate without further energy 
input, i.e., thermally. In contrast, at or below 100 K motions of the pro- 
tein and internal waters are severely restricted, arresting the protein in 
the K intermediate (cryotrapping) even after illumination has ceased. 
However, because of the large spectral overlap between the ground state 
and the K intermediate (Amax difference 22 nm), it is not possible to 
achieve K intermediate occupancies higher than about 50% (Balashov 
et al., 1991; Balashov and Ebrey, 2001). With cryotrapping there is also al- 
ways the possibility of trapping unnatural intermediates, in other words, 
conformations that do not normally occur during the room-temperature 
photocycle (Balashov and Ebrey, 2001). 

In 1999 Landau and co-workers published a set of 2.1 A structures 
for a 35%-occupied, low-temperature K intermediate (Edman et al., 


THE MECHANISM OF ION PUMPING IN BACTERIORHODOPSIN 119 


1999), which, for unknown reasons, required continuous green light 
illumination during X-ray diffraction data collection. The main finding 
was that “a key water molecule [water 402] is dislocated, allowing the 
primary proton acceptor, Asp 85, to move.” Other groups, including 
ours, have also attempted to obtain high-resolution structures of the K 
intermediate, but up to now decided that meaningful interpretation was 
too difficult because of the combination of limited resolution, twinning, 
and less than 50% occupancy, compounded by the nearly complete spa- 
tial overlap of all-trans and 13-cis,15-anti retinal (Fig. 5, see color insert). 
Furthermore, despite Raman and FTIR evidence of significant strain 
in the K intermediate (Braiman and Mathies, 1982; Maeda et al., 1991; 
Rothschild et al., 1986; Siebert and Mantele, 1983), the retinals in the 
published K models were forced into planar (i.e., nonstrained) all-trans 
and 13-cis,15-anti configurations as described in the Methods section of 
Edman et al. (1999): 


At 2.1 A resolution, the two closely overlapping retinal configurations (all-trans,15-anti 
and 13-cis,15-anti) could not be refined free from bias resulting from stereochemical 
constraints. The retinal backbone and Schiff base linkage up to Ce of Lys 216 were con- 
strained to be planar for both the all-trans, 15-anti and the 13-cis,15-anti configurations. 


The structure of the L intermediate is arguably the most interesting 
one. In the L state, the strain of the retinal has relaxed considerably, and 
the active site is now primed for the decisive event in the photocycle, the 
protonation of Asp-85, coupled to the deprotonation of the Schiff base 
(L to M reaction). The key question in this point is the orientation of 
the N-H bond (charge dipole) of the protonated Schiff base just before 
it gives up its proton. Does the N-H point toward the hydrophobic 
cytoplasmic side, as a relaxed 13-cis,15-anti configuration would suggest? 
And if this were the case, which entity would be the acceptor of the 
Schiff base proton? Or is there still considerable strain in the retinal 
(particularly around the C13=C14, C14—C15; and C;;=NZ bonds), enough 
to keep the orientation of the N-H bond toward the extracellular side, 
in the same direction that it pointed to in the ground state? More details 
regarding this issue can be found in Luecke (2000). 

Only a high-resolution structural method that overcomes the difficul- 
ties of deconvoluting multiple, spatially highly overlapping structures 
(Fig. 4) is likely to be able to accurately answer this question. The 
spectral overlap problem is a principal one since even under ideal 
conditions it is not possible to achieve occupancies of the K or L 
intermediate higher than about 50% and 70%, respectively, because of 
the extensive spectral overlap between the ground state and both the 
K and the L intermediate (Amax differences of +22 nm and —28 nm, 
respectively) (Balashov and Ebrey, 2001). 
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Landau and co-workers reported a merohedrally twinned 2.1 Ä struc- 
ture, claimed to be of a 70%-occupied, low-temperature L intermediate 
(Royant et al., 2000). The claimed L-intermediate occupancy was later 
corrected to 36%, in a mixture with 40% ground state, 12% Kand 12% 
M intermediate (Royant et al., 2001b). The main interpretation of this 
structure was that 


a bend of this helix [helix C] enables the negatively charged primary proton acceptor, 
Asp 85, to approach closer to the positively charged primary proton donor, the Schiff 
base. The primary proton transfer event would then neutralize these two groups, cancel- 
ing their electrostatic attraction and facilitating a relaxation of helix C to a less strained 
geometry. 


In other words, an approach of Asp-85 toward the Schiff base was re- 
ported, presumably resulting in direct proton transfer of the Schiff base 
proton to Asp-85. Even with a decreased donor-acceptor distance, for 
this proton transfer to occur, the Schiff base N-H bond would have to 
point toward Asp-85, i.e., mostly in the extracellular direction. This in 
turn would require severe rotational distortions about one or more of 
the three bonds between Cj3 and NZ (as discussed above) since the reti- 
nal is in a 13-cis,15-anti configuration at this point. However, the retinal 
of this L structure (PDB entry 1EOP), like the retinals of the K structures, 
appears to be in a planar, relaxed 13-cis,15-anti configuration, with the 
Schiff base N-H bond pointing toward the hydrophobic cytoplasmic 
side, and thus maximally away from the acceptor, Asp-85. Unlike in the 
K paper, the L paper contains no information regarding retinal planarity 
restraints during refinement. 

Furthermore, the distances measured for PDB entry 1EOP do not ap- 
pear to support the claim that a bending of helix C causes an approach 
Oe, decrease in distance) of Asp-85 toward the Schiff base. The distance 
from the NZ of the all-trans (i.e., ground state) retinal to the Cy of Asp-85 
of the ground state is 6.37 A, whereas the same pair in the L interme- 
diate structure is 6.52 A apart. This relative increase does not take into 
account the extra increase one would obtain if one were to measure the 
distance from Asp-85 to the Schiff base N-H hydrogen instead of the 
NZ nitrogen. Because of the reorientation of the N-H bond this would 
further increase the distance by up to 2 A. 

There remain concerns about the actual amount of trapped L in the 
studies, based in part on the published visible spectra of the crystals 
(Balashov and Ebrey, 2001). In particular, the contamination by the M in- 
termediate (deprotonated Schiff base, Amax near 410 nm, at least 12% 
of sample) would tend to dominate the changes in diffraction intensi- 
ties due to the relatively larger structural changes of the M and later 
photocycle intermediates. In this case difference electron density maps 
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would reveal a convolution of features of the K, L, and M states. Further- 
more, it is now clear that the determination of the partial occupancy of 
the L state by crystallographic refinement alone, which yielded the ini- 
tial 70% value (Royant et al., 2000), against a background of structurally 
highly overlapping states (BR, K, M) at medium resolution and in the 
presence of merohedral twinning led to an overestimation by a factor of 
at least 2, as the authors themselves have acknowledged (Royant et al., 
2001b). 


IV. M INTERMEDIATES 


The first photocycle intermediate to be determined exploited the fact 
that decay of the M intermediate is dramatically slowed down (Tip of 
several seconds versus tens of microseconds) when the proton donor 
Asp-96 in the cytoplasmic half-channel is replaced with asparagine, a 
nonionizable analogue (D96N mutant; Luecke et al., 1999b; Otto et al., 
1989; Tittor et al., 1989). In this mutant the Schiff base eventually repro- 
tonates from the distant cytoplasmic surface for lack of a nearby proton 
donor. Illumination of the D96N mutant crystals with red or yellow light 
at ambient temperature caused the initially deep purple (ground state) 
crystals to turn light yellow in less than 1 s. Once full conversion was 
obtained, the crystals were flash-cooled in the 100 K cryostream for data 
collection, yielding a fully occupied (i.e., full depletion of ground state 
absorption at 570 nm) cryotrapped M intermediate that persists even 
after the illumination is turned off (Luecke et al., 1999b). In contrast, 
turning off the illumination at ambient temperature will cause the crys- 
tals to become deep purple again in a few seconds, indicating thermal 
completion of the photocycle to the ground state. 

When working with any mutant, even in the case of a conservative 
single-residue replacement, there is always a possibility of changes due 
to the mutation, in particular changes to the local water structure. For 
that reason, a second single-site mutant with slow M decay was also crys- 
tallized and investigated, but this time the mutation was located on the 
extracellular side. The mutation of Glu-204, which is part of the proton 
release group, to a glutamine (E204Q mutant; Brown et al., 1995; Luecke 
et al., 2000b) results in markedly slower M decay in the 3D crystals, but, 
for reasons not understood, displays slow O decay in suspension. The 
M structure obtained from this mutant represents an M earlier in the 
photocycle than that obtained from the D9I6N mutant, presumably 
because the E204Q mutation uncouples the protonation of Asp-85 from 
proton release to the extracellular surface, a step that in the native pho- 
tocycle occurs prior to the reprotonation of the Schiff base from Asp-96. 


122 LUECKE AND LANYI 


To exclude possible interference from mutations, various other groups 
used wild-type protein, yielding M intermediate occupancies ranging 
from 35% to 100%, depending on the illumination and cryotrapping 
protocol. Illumination at room temperature followed by rapid cool- 
ing resulted in a mixture of ground state (35%), early M (30%), late 
M (35%), and some N (not specified) (Sass et al., 2000). A different 
protocol first cooled the wild-type crystals to 100 K, then increased the 
temperature to 230 K, followed by 3 min of illumination and subsequent 
cooling back to 100 K for data collection (Facciotti et al., 2001). A third 
PDB entry deposited 2 years ago is as yet without an accompanying pa- 
per and cannot be evaluated for lack of information (PDB entry 1DZE, 
Kouyama et al.). 

All five structures of M intermediates (Table I) emphasize the crucial 
role of discrete internal water molecules in the pump cycle. In all of these 
structures, water 402, which in the ground state is bridging the proto- 
nated Schiff base and the two nearby anionic aspartates, is no longer 
detectable. In several structures, additional waters not observed in the 
ground state were detected in the cytoplasmic half-channel, extending 
from the proton donor during the M to N transition, Asp-96, toward the 
deprotonated Schiff base. In addition, one of these studies describes a 


crystallographically unusual water molecule, W740q, which is surrounded by mostly 
hydrophobic residues (Thr 170, Phe 42, Leu 100, Leu 223, Ile 229) without hydrogen- 
bonding partners in its vicinity 


(Sass et al., 2000). This water is located between Asp-96 and the cytoplas- 
mic surface and the authors suggest 


that the contact to Asp 96 might be achieved in the millisecond time domain by a 
fluctuating water molecule and not by a continuous water channel. 


Near the Schiff base, the protein begins to react to the kink in the reti- 
nal caused by photoisomerization about the C}3=C14 double bond. This 
allows progressive upward movement of the Cı3-methyl group in the di- 
rection of the nearby indole moiety of Trp-182. Early M structures (1F4Z, 
1KG8) show less displacement and a limited effect on the Trp-182 side 
chain, whereas late M structures (1C8S, 1CWO) show a 1.5 A movement 
of the C-13 methyl group with a concomitant displacement of the indole 
by over 1 A, leading to the disruption of the water bridge (water 502) 
between the Trp-182 indole nitrogen and the C=O of Ala-215 of helix G 
(Luecke et al., 1999b, 2000b). 

The early part of the M phase is dominated by reactions in the extra- 
cellular region in response to the electrostatic changes at the Schiff base. 
The titration behavior of Asp-85 revealed that there is coupling between 
the protonation state of this aspartate and another protonatable group 
(Govindjee et al., 1996). All five M structures show that the means of this 
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coupling is the shuttling of the positive charge of Arg-82 from a ground 
state position roughly midway between the two pairs Asp-85/ Asp-212 
and Glul94/Glu204 toward Glul94/Glu204 in M, when Asp-85 is pro- 
tonated. Protonation of Asp-85 greatly diminishes its electrostatic in- 
teraction with the positively charged guanidinium of Arg-82, which is 
now free to approach the region of Glu-194 and Glu-204, the two acidic 
residues implicated, directly or indirectly, in proton release. The ap- 
proach of a positive charge is thought to stabilize the deprotonated 
from of the proton release group, leading to proton release at the extra- 
cellular surface. This type of long-range electrostatic coupling (in this 
case over a distance of 14 A) could be a general mechanism in sys- 
tems in which conformational or pK changes need to be propagated 
rapidly. 


V. LARGE-SCALE CONFORMATIONAL CHANGES IN THE M, N, 
AND O INTERMEDIATES 


As discussed above, crystallographic structures have been reported for 
the M state of the wild-type protein at ambient temperature and at 230 K, 
as well as in the D96N and E204Q mutants. Because the overlap between 
the spectra of the M state and the unilluminated state is small, and in all 
cases listed but the first the decay of M is slow, the occupancy in photo- 
stationary states is nearly 100%. This is a unique advantage and makes 
the structures of M the best defined of all the intermediates. The results 
confirm the earlier hints from projection maps (Dencher et al., 1989; 
Han et al., 1994; Nakasako et al., 1991) that the largest changes involve 
helices F and G. The various M states studied are not equivalent. The M 
produced at 230 K might correspond approximately to the earliest M, 
the M; state that is in equilibrium with L. The E204Q mutation inhibits 
proton release to the extracellular surface during the rise of the M state, 
but the D96N mutation inhibits the reprotonation of the Schiff base and 
thus M decay. For this reason, the M intermediates in these mutants were 
assumed to correspond to an “early” (but later than the earliest M) and 
a “late” M state, respectively, in the wild-type photocycle. Indeed, there 
are progressive changes in the three structures. In the earliest M there is 
little change in the positions of any of the helices (Facciotti et al., 2001). 
In the M presumed to follow it, the cytoplasmic end of helix F, from 
approximately residue 182, tilts outward by about 0.7 A. In the late M 
the cytoplasmic end of helix F, from residue 177, is disordered (Luecke 
et al., 1999b), presumably because the magnitude of the tilt is greater 
and locally disrupts crystal contacts. Helix G also undergoes progressive 
changes, locally at the z-bulge. The kink in helix G associated with the 
z-bulge at Lys-216 becomes less extreme as its connection with helix 
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F through the hydrogen bond of the C=O of Ala-215 to water 501 and 
from there to the indole N of Trp-182 is lost. 

The causes of these changes are (1) the relaxation of the polyene 
chain of the photoisomerized 13-cis,15-anti retinal, constrained into an 
extended form up to this time in its binding site, that moves the 13-methyl 
group against the indole ring of Trp-182, and (2) the displacements of 
the Lys-216 side chain. These changes are more pronounced in the late 
M than in the earlier M. The coordinated movement of the chain of 
covalent and hydrogen bonds linking Trp-182 to the main chain of helix 
G and through water 502 to helix C and thus Asp-96, and the repacking 
of the side chains between helices F and G that ensue are related to the 
entry of two additional water molecules into the cytoplasmic region. In 
the M state of the E204Q mutant (where this region is not changed by 
the mutation), water 504 is now interposed between Asp-96 and Thr-46. 
Water 503 forms hydrogen bonds with water 502 and 504, as well with 
the C=O of Lys-216. 

There is very little direct information about the crystallographic struc- 
tures of the N and O states. Projection maps of M and N, in the wild 
type (for M), D96N (for M), or F171C (for N), exhibit the same kind 
of electron density changes at helices C, G, and F (Subramaniam and 
Henderson, 2000b). Low-resolution electron microscopy of tilted two- 
dimensional crystals of the F219L mutant with long-living N confirmed 
that there is a considerable outward tilt of the cytoplasmic end of he- 
lix F (Vonck, 2000). With occupancies greater than can be achieved for 
the N intermediate (about 30%) whose spectrum strongly overlaps the 
spectrum of the unilluminated state, the tilt should create steric con- 
flicts between trimers. This is likely to be the case for the M state of the 
D96N mutant, where it may be the cause of the local disorder in the 
three-dimensional crystals at helix F. 

A novel approach that attempts to overcome the problems of both low 
and high occupancy is to search for mutations that, for one reason or 
another, assume stable structures similar to the transient photocycle in- 
termediates. Many residue replacements in the cytoplasmic region cause 
drastic slowing of the reisomerization of the retinal and/or protonation 
reactions in the second half of the photocycle. When three of these muta- 
tions were introduced together into the protein (D96G/F171C/F219L), 
helix F was seen to tilt outward as in the M and N states, but without illu- 
mination (Subramaniam and Henderson, 2000a,b). There was no addi- 
tional structural change in the photocycle of this mutant. It appears that 
when some degree of stabilization conferred on it by the mutations, this 
M-like (or N-like) conformation represents a second energy minimum 
to the normal bacteriorhodopsin conformation. 
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Changing the ionization state of key residues in the protein to resem- 
ble their states in the photocycle is another approach to simulating struc- 
tural changes in intermediates. Mutants in which Asp-85 is replaced bya 
nonionizable residue, such as asparagine or serine, exist as three species 
in pH dependent equilibrium (Turner et al., 1993). Their absorption 
maxima resemble those of the M, N, and O states. The yellow, M-like 
state, at pH > 9, contains deprotonated retinal Schiff base, and the den- 
sity changes at helices F and G characteristic of M are indeed evident 
in projection X-ray maps of unilluminated samples (Brown et al., 1997; 
Vonck, 2000). The purple, N-like state, at pH between 7 and 9, con- 
tains protonated Schiff base but deprotonated Asp-96, and the retinal is 
13-cis,15-anti even without illumination (Dioumaev etal., 1998) . It resem- 
bles the N state in its changed infrared amide bands. The blue, O-like 
state, at pH < 7, contains all-trans retinal and protonated Asp-96. The 
structure of this state, from the D85S mutant, has been solved to 2.25 A 
resolution (Rouhani et al., 2001). Its global conformation, induced evi- 
dently by missing the negative charge from the Schiff base region from 
replacement of Asp-85 with a neutral residue, is considerably different 
from both the unilluminated state and the M and N intermediates. 

In the O-like state the extracellular ends of helices A, B, C, and D are 
tilted outward, but their cytoplasmic ends are not displaced. Helix E is 
tilted also, but around a pivot point near its middle, so its extracellular 
and cytoplasmic ends are displaced outward and inward, respectively. If 
this structure is indeed like that of the O state, the implication is that 
the protein undergoes a scissoring motion in the second half of the 
photocycle. It begins with a splaying of the cytoplasmic side of the seven 
helical bundle in M, which continues in N but reverses in O and opens the 
extracellular cavity instead. These suggested large-scale global motions 
are in sharp contrast with the relatively small (1-2 A) and more local 
atomic displacements in the first half of the photocycle. The rationale 
must be that the structure of the protein in the unilluminated state 
predisposes it to the early reactions in the cycle, but the later reactions 
require drastically different conformations. 


VI. PROTONATION PATHWAYS IN THE M TO N 
AND THE N TO O REACTIONS 


In the second half of the photocycle the Schiff base is reprotonated 
in a protonation equilibrium with Asp-96, and this is followed by repro- 
tonation of Asp-96 from the cytoplasmic surface. Unlike in the extra- 
cellular region, where a three-dimensional hydrogen-bonded network 
already exists to facilitate the earlier events, involving transfer of protons 
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(or hydroxyl ions), in the photocycle, the cytoplasmic region contains no 
such features. Lacking polar side chains or coordinated water molecules 
to conduct protons, this region will have to rely on the entry and specific 
binding of water to provide a pathway for ion transfers, as well as to lower 
the initially very high pKa of Asp-96. How would large-scale conforma- 
tional changes in N and O accomplish this? 

At pH > 8 the kinetic relationship of M, N, and O is simpler than at 
lower pH and can be described with the scheme M <> N —> O. The pro- 
tonation equilibrium of the Schiff base with Asp-96, the M <> N reaction, 
is pH independent consistent with internal proton transfer, but the re- 
protonation of Asp-96, the N —> O reaction, is pH dependent consistent 
with proton uptake from the bulk. Thus, one should expect a two-stage 
conformational shift, to enable ion transfer first between the Schiff base 
and Asp-96, and then between Asp-96 and the cytoplasmic surface. 

The network of hydrogen-bonded water in M extends from Asp-96 to- 
ward the Schiff base but does not reach it (Luecke et al., 2000b). It is likely 
that the rise of the N state depends on completing the proton transfer 
pathway by several more water additional molecules. The large influence 
of osmotic agents (Cao et al., 1991) on the M > N equilibrium had sug- 
gested that this reaction, uniquely in the photocycle, involves increase 
in bound water. However, to date there is no crystallographic structure 
available for the N state that would reveal the positions of the putative 
additional water molecules and the groups that coordinate them. 

The M to N and the N to O reactions both occur on the millisecond 
time scale, yet Asp-96 cannot be in communication with the cytoplasmic 
surface at the time its protonation equilibrium with the Schiff base is 
established. It is likely that this is ensured by the hydrophobic shield 
between Asp-96 and the aqueous interface that contains the side chains 
of Phe-42, Leu-100, Phe-171, and Leu-223. At the time Asp-96 is repro- 
tonated from the bulk in the N to O reaction, this shield must be made 
permeable, while the pathway between Asp-96 and the Schiff base is abol- 
ished. The crystal structure of the O-like state suggests how these two 
goals are accomplished (Fig. 6, see color insert; Rouhani et al., 2001). 
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I. INTRODUCTION 


Succinate:quinone oxidoreductases (EC 1.3.5.1; Hägerhäll, 1997; 
Lancaster, 2002a,b) are enzymes that couple the two-electron oxidation 
of succinate to fumarate (reaction 1) to the two-electron reduction of 
quinone to quinol (reaction 2). 


Succinate = fumarate + 2H* + 2e7 (1) 


Quinone + 2H* + 2e7 = quinol (2) 


They can also catalyze the opposite reaction, the coupling of quinol ox- 
idation to quinone to the reduction of fumarate to succinate (Lemma 
et al., 1991). The cis-configuration isomer of fumarate, maleinate, is nei- 
ther produced in the oxidation reaction nor consumed as a substrate 
in the reduction reaction, i.e, the reaction is stereospecific in both di- 
rections. Depending on the direction of the reaction catalyzed in vivo, 
the members of the superfamily of succinate:quinone oxidoreductases 
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can be classified as either succinate:quinone reductases (SQR) or 
quinol:fumarate reductases (QFR). SQR and QFR can be degraded 
to form succinate dehydrogenase and fumarate reductase (both EC 
1.3.99.1), which no longer react with quinone and quinol, respectively. 

SQR and QFR complexes are anchored in the cytoplasmic membranes 
of archaebacteria and eubacteria and in the inner mitochondrial mem- 
brane of eukaryotes with the hydrophilic domain extending into the 
cytoplasm and the mitochondrial matrix, respectively. 

SQR (respiratory complex II) is involved in aerobic metabolism as 
part of the citric acid cycle and of the aerobic respiratory chain (Saraste, 
1999). QFR participates in anaerobic respiration with fumarate as the 
terminal electron acceptor (Kröger, 1978; Kröger et al., 2002) and is part 
of the electron transport chain catalyzing the oxidation of various donor 
substrates (e.g., Ha or formate) by fumarate. These reactions are coupled 
via an electrochemical proton potential (Ap) to ADP phosphorylation 
with inorganic phosphate by ATP synthase (Mitchell, 1979). 

Succinate:quinone oxidoreductases generally contain four protein 
subunits referred to as A, B, C, and D. Subunits A and B are hydrophilic, 
whereas the subunits C and D are integral membrane proteins. Among 
species, subunits A and B have high sequence homology, while that for 
the hydrophobic subunits is much lower. Most of the SQR enzymes of 
gram-positive bacteria and the QFR enzymes from e-proteobacteria con- 
tain only one larger hydrophobic polypeptide (C), which is thought to 
have evolved from a fusion of the genes for the two smaller subunits C 
and D (Hagerhall and Hederstedt, 1996; Hederstedt, 1999; Lancaster 
et al., 1999). While subunit A harbors the site of fumarate reduction 
and succinate oxidation, the hydophobic subunit(s) contain the site of 
quinol oxidation and quinone reduction. 

Based on their hydrophobic domain and heme content (Hagerhall 
and Hederstedt, 1996; Hederstedt, 1999), succinate:quinone oxidore- 
ductases can be classified in five types (cf. Fig. 1; Lancaster and Kröger, 
2000; Lancaster, 2001a). Type A enzymes contain two hydrophobic sub- 
units and two heme groups, e.g., SQR from the archaea Archaeoglobus 
fulgidus, Natronomonas pharaonis, and Thermoplasma acidophilum. Type B 
enzymes contain one hydrophobic subunit and two heme groups, as is 
the case for SQR from the gram-positive bacteria Bacillus subtilis and 
Paenibacillus macerans and QFR from the e-proteobacteria Campylobacter 
jejuni, Helicobacter pylori, and Wolinella succinogenes. Examples for type C 
enzymes, which posses two hydrophobic subunits and one heme group, 
are SQR from mammalian mitochondria and from the proteobacteria 
Paracoccus denitrificans and Escherichia coli and QFR from the nematode 
Ascaris suum. The QFR of E. coli is an example of a type D enzyme, 
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Succinate Succinate Succinate Succinate Succinate 
Fumarate + 2H+ Fumarate + 2H+ Fumarate + 2H* Fumarate + 2H+ Fumarate + 2H*+ 














Fıc. 1. Classification (A to E) of succinate:quinone oxidoreductases (Lancaster and 
Kröger, 2000; Lancaster 2001a) based on their hydrophobic domain and heme content 
(Hägerhäll and Hederstedt, 1996; Hederstedt, 1999). The hydrophilic subunits A and B 
are drawn schematically in gray and dark gray, respectively, the hydrophobic subunits C 
and D in light gray or gray. Heme groups are symbolized by small rectangles. The direc- 
tions of the reactions catalyzed by SQR and QFR are indicated by gray and black arrows, 
respectively. White rectangles symbolize the respective cytoplasmic or inner mitochon- 
drial membrane bilayer. The positive (+) and negative (—) sides of the membrane are 
indicated. In bacteria, the negative side is the cytoplasm (“inside”), the positive side the 
periplasm (“outside”). For mitochondrial systems, these are the mitochondrial matrix and 
the intermembrane space, respectively. The type of quinone transformed in vivois not nec- 
essarily unique for each type of enzyme. The examples given are thermoplasma-quinone 
(TK), menaquinone (MK), ubiquinone (Q), and caldariella quinone (CQ; Collins and 
Jones, 1981; Hagerhall, 1997; Lancaster, 2001b). See text for further details. Modified 
from Lancaster (2001). FEBS Lett. 504, 133-141, with permission from Elsevier Science. 


which contains two hydrophobic subunits and no heme group. Finally, 
type E enzymes, such as SQRs from the archaea Acidianus ambivalens 
and Sulfolobus acidocaldarius, but also from the proteobacterium C. jejuni 
and the cyanobacterium Synechocystis, also contain no heme, but have 
two hydrophobic subunits very different from the other four types and 
more similar to those of heterodisulfide reductase from methanogenic 
archaea (Schäfer et al., 1999). The phylogenetic analyses presented re- 
cently (Lemos et al., 2002; Schafer et al., 2002) corroborate the above 
classification scheme. 

Generally, succinate:quinone oxidoreductases contain three iron- 
sulfur clusters, which are exclusively bound by the B subunit. Enzyme 
types A-D contain one [2Fe-2S]**!*, one [4Fe-45]?*!*, and one 
[3Fe-4S]!+° cluster, whereas an additional [4Fe-4S] cluster apparently 
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replaces the [3Fe-4S] in the type E enzyme (Gomes et al., 1999). The 
A subunit of all described membrane-bound succinate:quinone oxidore- 
ductase complexes contains a covalently bound FAD prosthetic group 
(Singer and McIntire, 1984). The chemical structure of the linkage as 8a- 
[Ne-histidyl]-FAD was first established for mammalian SQR (Walker and 
Singer, 1970) and subsequently for the QFR enzymes of W. succinogenes 
(Kenny and Kröger, 1977) and E. coli (Weiner and Dickie, 1979). 


II. OVERALL DESCRIPTION OF THE STRUCTURE 


The currently available crystal structures of succinate:quinone oxi- 
doreductases are those of two prokaryotic quinol:fumarate reductases, 
both since 1999. The E. coli QFR, determined at 3.3 A (Iverson et ol. 
1999), belongs to the type D enzymes, and the QFR of W. succinogenes, 
refined at 2.2 A resolution (Lancaster et al., 1999), is of type B. Three 
structures of the latter enzyme, based on three different crystal forms, 
are available. The first two, PDB entries IQLA and 1QLB (Lancaster 
et al., 1999), are considerably better defined and more accurate than 
the structure of the third crystal form, PDB entry 1E7P (Lancaster et al., 
2000, 2001). Therefore, the first two crystal forms of W. succinogenes QFR 
will be used for the description of structural features, and that of the 
third crystal form will be referred to for comparison. 

In all three W. succinogenes QFR crystal forms, two heterotrimeric com- 
plexes of A, B, and C subunits are associated in an identical fashion, thus 
forming a dimer (Fig. 2, see color insert). Wolinella succinogenes QFR has 
an overall length of 120 A in the direction perpendicular to the mem- 
brane. Parallel to the membrane, the maximum width is 130 A for the 
dimer, and 70 A for the monomer. Approximately 3665 A? (8%) of the 
W. succinogenes QFR monomer surface is buried upon dimer formation. 
As derived from analytical gel filtration experiments, this dimer is ap- 
parently also present in the detergent-solubilized state of the enzyme 
(Lancaster and Kröger, 2000; Unden et al., 1980), implying that it is 
unlikely to be an artifact of crystallization. 


HI. THE HYDROPHILIC SUBUNITS 


A. Subunit A, the Flavoprotein, and Interdomain Movement at the Site of 
Fumarate Reduction 
Wolinella succinogenes QFR subunit A, of 73 kDa (Lauterbach et al., 


1990), is composed of four domains (Fig. 3a, see color insert), the bipar- 
tite FAD binding domain (blue, residues Al-260 and A366-436, with “A” 
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indicating the A subunit), into which the capping domain (green, A260- 
366) isinserted, the helical domain (red, A436-554), and the C-terminal 
domain (A554-656, not shown in Fig. 3a.). The FAD is covalently bound 
as 8a-[Ne-histidyl]-FAD (Kenny and Kröger, 1977) to the residue His 
A43. The capping domain contributes to burying the otherwise solvent- 
exposed FAD isoalloxazine ring from the protein surface. 

A W. succinogenes QFR crystal grown in the presence of fumarate was 
found to be of crystal form “B.” The structure was refined at 2.33 A 
resolution (PDB entry 1QLB; Lancaster et al., 1999). This allowed the 
localization of the fumarate binding site between the FAD binding do- 
main and the capping domain next to the plane of the FAD isoalloxazine 
ring (Fig. 3b). The structure of the enzyme in the third crystal form, “C” 
(Lancaster et al., 2000), was refined at 3.1 A resolution (PDB entry 1E7P; 
Lancaster et al., 2001). Compared with the previous crystal forms, the al- 
tered crystal packing (Lancaster, 2002c,d) results in the capping domain 
being in a different arrangement relative to the FAD-binding domain 
(Fig. 3a). This leads to interdomain closure at the fumarate reducing site, 
suggesting that the structure encountered in this crystal form represents 
a closer approximation to the catalytically competent state of the enzyme 
(Fig. 3b). The trans hydrogenation of fumarate to succinate could occur 
by the combination of the transfer of a hydride ion and of a proton from 
opposite sides of the fumarate molecule. One of the fumarate methenyl 
carbon atoms could be reduced by direct hydride transfer from the N-5 
position of the reduced FADH,, while the other fumarate methenyl car- 
bon is protonated by the side chain of Arg A301 (Figs. 3c and 3d). The 
latter residue replaces the water molecule previously suggested to be the 
proton donor (Lancaster et al., 1999) based on the structure in crystal 
form B (Fig. 3b). The assignment as to which of the fumarate methenyl 
carbon atoms accepts the hydride and which the proton is currently 
ambiguous (Fig. 3c versus 3d), because data of sufficient completeness 
and quality for this crystal form have so far only been obtained for the 
complex with malonate and not yet in the presence of fumarate. Re- 
lease of the product could be facilitated by movement of the capping 
domain away from the dicarboxylate site (Lancaster et al., 1999, 2001). 
All residues implicated in substrate binding and catalysis are conserved 
throughout the superfamily of succinate:quinone oxidoreductases, so 
that this reversible mechanism is considered generally relevant for all 
succinate:quinone oxidoreductases. 

This mechanistic interpretation of the structure is supported by the 
results from site-directed mutagenesis, where Arg A301 was replaced rel- 
atively conservatively by a Lys (Lancaster et al., 2001). Strain FrdA-R301K 
contained a variant enzyme, very similar to the wild-type enzyme in terms 
of cofactor and subunit composition, in particular a fluorescence typical 
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for FAD covalently bound to the A subunit, but which lacked succinate 
dehydrogenase and fumarate reductase activity (Lancaster et al., 2001). 
The loss in enzymatic activity is tentatively attributed to the fact that Lys 
(PKsoı = 10.8) cannot substitute for Arg (p Kso = 12.5) in protonating 
the fumarate methenyl carbon, possibly because the protonating group 
is no longer close enough to protonate the fumarate methenyl group. 


B. Subunit B, the Iron-Sulfur Protein 


The Ca trace of W. succinogenes subunit B is shown in Fig. 4. This 
subunit of 27 kDa (Lauterbach et al., 1990) consists of two domains, 
an N-terminal “plant ferredoxin” domain (B1-106) binding the [2Fe- 
2S] iron-sulfur cluster and a C-terminal “bacterial ferredoxin” domain 
(B106-239) binding the [4Fe-4S] and the [3Fe-4S] iron-sulfur clusters. 
The [2Fe-2S] iron-sulfur cluster is coordinated by the Cys residues B57, 
B62, B65, and B77 as proposed on the basis of sequence alignments 
(Lauterbach et al., 1990). All four Cys residues are within segments that 
are in contact with the A subunit. The [4Fe-4S] iron-sulfur cluster is 
ligated to the protein through Cys residues B151, B154, B157, and B218, 


[2Fe-2S] 





[3Fe-4S] 








Fic. 4. Subunit B, the iron-sulfur protein of W. succinogenes QFR. The Ca trace is 
color-coded from light gray to dark gray for the amino-terminal [2Fe-2S] domain on the 
right (residues Bl-106) and again from light gray to dark gray for the carboxy-terminal 
[7Fe-8S] domain on the left (B106-B239). Modified from Lancaster et al. (1999). Nature 
402, 377-385, with permission from Nature Publishing Group. 
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and the [3Fe-4S] cluster is coordinated by Cys residues B161, B208, and 
B214. The latter three residues are within segments that are in contact 
with the C subunit. At the position corresponding to the fourth Cys of 
the [4Fe-4S] cluster, the W. succinogenes QFR [3Fe-4S] cluster contains a 
Leu. Whereas the introduction of a Cys into E. coli QFR (Mandori et al., 
1992) could replace the native [3Fe-45] by a [4Fe-4S] cluster, this was 
not the case for B. subtilis SQR (Hagerhall et al., 1995). 


IV. SUBUNIT C, THE INTEGRAL MEMBRANE DIHEME CYTOCHROME b 


The Ca trace of W. succinogenes subunit C is shown in Fig. 5a. This sub- 
unit of 30 kDa (Körtner et al., 1990) contains five membrane-spanning 
segments with preferentially helical secondary structure. For systematic 
reasons within the superfamily of succinate:quinone oxidoreductases, 
these segments are labeled [according to Hägerhäll and Hederstedt 
(1996)] I (C22-52), II (C77-100), IV (C121-149), V (C169-194), and 
VI (0202-237). To a varying degree, all five transmembrane segments 
are tilted with respect to the membrane normal, and helix IV is strongly 
kinked at position C137 (Lancaster et al., 1999). This kink is stabilized by 
the side-chain y-hydroxyl of Ser C141, which, instead ofits backbone NH, 
donates a hydrogen bond to the carbonyl oxygen of Phe C137 (Fig. 5b). 
As pointed out earlier (Lancaster and Kroger, 2000), this feature is very 
similar to that found for helix F of bacteriorhodopsin [bR, PDB entry 
1C3W (Luecke et al., 1999), Figs. 5c and 5d]. 

The planes of both heme molecules bound by the W. succinogenes en- 
zyme are approximately perpendicular to the membrane surface and 
their interplanar angle is 95° (Lancaster et al., 1999). The axial ligands 
to the “proximal” heme dp are His C93 of transmembrane segment II and 
His C182 of transmembrane segment V (Fig. 6a). This causes heme bp to 
be located toward the cytoplasmic surface of the membrane, and thus to- 
ward the [3Fe-4S] iron-sulfur cluster. Hydrogen bonds and salt bridges 
with the propionate groups of heme bp are formed with the side chains 
of residues Gln C30, Ser C31, Trp C126, and Lys C193 (Fig. 6a). Thus, 
side chains from the residues of the first four transmembrane segments 
are involved in the binding of heme bp (Lancaster et al., 1999), which 
underscores the structural importance of the bound heme (Simon et al., 
1998). The axial ligands to the “distal” heme bp are His C44 of transmem- 
brane segment I and His C143 of transmembrane segment IV (Fig. 6b), 
demonstrating that all four heme axial ligands had been correctly pre- 
dicted by sequence alignment (Körtner et al., 1990) and site-directed 
mutagenesis (Simon et al., 1998). Residues of W. succinogenes QFR sub- 
unit C conserved among the succinate:quinone oxidoreductases from 


138 C. ROYD. LANCASTER 





Fic.5. Wolinella succinogenesQFR subunit C. (a) Subunit C, the diheme cytochrome b of 
W. succinogenes QFR (stereo views). Also shown are the proximal (upper) and distal (lower) 
heme groups and the position of the [3Fe-4S] cluster (top), which is bound by the B sub- 
unit (Fig. 4). Selected C subunit residues are labeled by their residue number. The Ca 
trace is drawn from light gray (amino terminus) to dark gray (carboxy terminus). (Modi- 
fied from Lancaster and Simon, 2002, Biochim. Biophys. Acta Bioenerget. 1553, 84-101, with 
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e-proteobacteria (Lancaster and Simon, 2002) are concentrated around 
the heme groups and the contact surface with subunit B (Fig. 6a). 
However, a distal “rim” of conserved residues is also apparent involving 
residues Glu C66, Ile C154, Ser C159, and Arg C162 (Fig. 6b). While the 
latter two residues interact with a propionate of heme bp, Glu C66 and 
Ile C154 are likely to line the oxidation site of the menaquinol substrate, 
as discussed below. 


V. GENERAL COMPARISON OF MEMBRANE-INTEGRAL DIHEME 
CYTOCHROME b PROTEINS 


As noted earlier (Lancaster et al., 1999), the binding of the two heme b 
molecules by an integral membrane protein four-helix bundle described 
here is very different from that described for the four-helix bundle of the 
cytochrome bc; complex (Xia etal., 1997). In the latter complex, only two 
transmembrane segments provide two axial heme b ligands each. This we 
refer to as a “two-helix motif” (Lancaster, 2002e; Fig. 7a). Examples for a 
“three-helix motif,” where one transmembrane helix provides two heme 
b ligands, and two others provide one heme b ligand each (Fig. 7b), 
may be found in the cases of membrane-bound hydrogenases (Berks 
et al., 1995; Gross et al., 1998), and formate dehydrogenases (Berks et al., 
1995; Jormakka et al., 2002). As described above, the axial ligands for 
heme binding in W. succinogenes QFR are located on four different trans- 
membrane segments (“four-helix motif”; Fig. 7c). One consequence of 
this difference is that the distance between the two heme iron centers is 
distinctly shorter in W. succinogenes QFR (15.6 A) than it is in the mito- 
chondrial cytochrome bc; complex (21 A; Xia et al., 1997) and in E. coli 
formate dehydrogenase-N (20.5 A; Jormakka et al., 2002). 





permission from Elsevier Science.) (b)-(d) Comparison of (b) transmembrane helix IV 
from W. succinogenes QFR (PDB entry 1QLA; Lancaster et. al., 1999) and (c) transmem- 
brane helix F from Halobacterium salinarum bacteriorhodopsin (bR) (PDB entry 1C3W; 
Luecke et al., 1999). In both panels hydrogen bonding interactions are indicated by 
black dashed lines. Highlighted hydrogen bonds donated from the Ser side chain are 
indicated by black solid lines. The corresponding interaction donated by the backbone 
NH for a standard a-helix is indicated as a gray dashed line. (b) The distance between Ser 
C141 N and Phe C137 O (indicated in gray dashes) is 4.0 A; the distance between Ser C141 
Oy and Phe C137 O (indicated in black) is 2.8 A. (c) The distance between Ser 183 N 
and Val 179 O (gray) is 3.0 A; the distance between Ser 183 Oy and Val 179 O (black) is 
2.6 A. (d) Superposition of QFR transmembrane helix IV (dark gray, from panel b), bR 
transmembrane helix F (light gray, from panel c), and an idealized a-helix (black). Mod- 
ified from Lancaster and Kröger (2000). Biochim. Biophys. Acta Bioenerget. 1459, 422-431, 
with permission from Elsevier Science. 





Fic. 6. Proximal (a) and distal (b) residues in the structure of W. succinogenes QFR 
that are conserved in the QFR enzymes from other e-proteobacteria (stereo views). The 
Ca trace of subunit C is also shown. (a) In addition of the Ca trace of subunit C, that 
of subunit B is shown at the top. Also indicated are two conserved B subunit residues 
(B209 and B216). All other labeling refers to selected proximal C subunit residues. The 
following prosthetic groups are included from the top left to the bottom right: the [4Fe- 
4S] cluster, the [3Fe-4S] cluster, the proximal heme, and the distal heme. (b) Selected 
distal C subunit residues are labeled. The proximal (upper) and distal (lower) heme are 
included as is a tentative menaquinol binding position. Transmembrane helices II and 
VI have been omitted for clarity. Modified from Lancaster and Simon (2002). Biochim. 
Biophys. Acta Bioenerget. 1553, 84-101, with permission from Elsevier Science. 
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Fic. 7. Diheme binding by integral membrane protein four helix bundles (Lancaster, 
2002e). (a) “Two-helix motif”: His ligands from two transmembrane helices (mitochon- 
drial cytochrome bc; complex). (b) “Three-helix motif”: His ligands from three trans- 
membrane helices (e.g., hydrogenase, formate dehydrogenase). (c) “Four-helix motif”: 
His ligands from four transmembrane helices (dihemeic succinate:quinone oxidoreduc- 
tases). Modified from Lancaster (2002). Biochim. Biophys. Acta Biomembr. 1565, 215-231, 
with permission from Elsevier Science. 


VI. RELATIVE ORIENTATION OF SOLUBLE AND MEMBRANE-EMBEDDED 
QFR SUBUNITS 


The structure of E. coli QFR can be superimposed on the structure 
of W. succinogenes QFR based on the hydrophilic subunits A and B 
(Figs. 3c and 6b in Lancaster et al., 1999). This similarity in structure 
was expected based on sequence comparisons. However, in this su- 
perimposition, the membrane-embedded subunits cannot be aligned. 
In an alternate superimposition, the transmembrane subunits C and 
D of the E. coli enzyme can be overlayed on to the W. succinogenes 
C subunit (Figs. 3d and 6c in Lancaster et al., 1999). Compared to 
the former superimposition, the latter involves a rotation around the 
membrane normal of approximately 180° and an orthogonal 25° rota- 
tion. This immediately leads to two important conclusions (Lancaster 
et al., 1999). First, the structures of the transmembrane subunits car- 
rying no hemes and two hemes, respectively, can be aligned to a sig- 
nificant degree, although only 11 of the aligned residues are identical. 
Second, the relative orientation of the soluble subunits and the trans- 
membrane subunits is different in the QFR complexes from the two 
species. 


VII. THE SITE OF MENAQUINOL OXIDATION/ MENAQUINONE REDUCTION 


The site of menaquinol oxidation on the diheme cytochrome b sub- 
unit of W. succinogenes QFR is not known. No convincing density for 
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a quinol or quinone could be found in any of the three crystal forms 
of the oxidized enzyme. No specific inhibitor of menaquinol oxidation 
by W. succinogenes QFR has been identified. The E coli QFR coordinate 
set IFUM [Iverson et al., 1999; now replaced by coordinate set 1LOV 
(Iverson et al., 2002) ] contains models for two menaquinone molecules 
per ABCD monomer. Although some of the atomic temperature factors 
of the quinone ring atoms were larger than 100 Å?, indicating that these 
quinone models may not be well defined, these models were included in 
Figs. 3d and 6c in Lancaster et al. (1999) for comparison. This structural 
alignment showed that the E coli QFR menaquinone models are at posi- 
tions occupied by heme propionates in W. succinogenes QFR (Lancaster 
et al., 1999). Consequently, no conclusions regarding quinone binding 
in W. succinogenes QFR could be drawn from the menaquinone positions 
in E colt QFR. 

In the W. succingenes QFR crystal structure, a cavity which extends from 
the hydrophobic phase of the membrane, close to the distal heme bp, 
to the periplasmic aqueous phase could accommodate a menaquinol 
molecule, after minor structural alterations (Lancaster et al., 2000), 
which are consistent with experimentally observed structural differences 
for the presence and absence of a quinone substrate (Lancaster and 
Michel, 1997). A glutamate residue (Glu C66) lines the cavity and could 
be involved in the acceptance of the protons liberated upon oxida- 
tion of the menaquinol (Fig. 6b). Replacement of Glu C66 by a glu- 
tamine residue resulted in a mutant which did not catalyze quinol oxi- 
dation by fumarate, whereas the activity of fumarate reduction was not 
affected by the mutation (Lancaster et al., 2000). X-Ray crystal struc- 
ture analysis of the Glu C66 — Gln variant enzyme ruled out signif- 
icant structural alterations. The midpoint potentials of the two heme 
groups of subunit C were not significantly affected. These results indi- 
cate that the inhibition of quinol oxidation activity in the mutant enzyme 
is due to absence of the carboxyl group of Glu C66. Thus it was con- 
cluded that Glu C66, which is conserved in the QFR enzymes from the 
€-proteobacteria C. jejuni and H pylori, is an essential constituent of the 
menaquinol oxidation site (Lancaster et al., 2000) close to heme bp 
(Fig. 6b). 


VIII. ELECTRON AND PROTON TRANSFER AND THE 
Wolinella succinogenes PARADOX 


For the function of QFR, electrons have to be transferred from the 
quinol-oxidizing site in the membrane to the fumarate-reducing site, 
protruding into the cytoplasm. The arrangement of the prosthetic 
groups in the QFR dimer is displayed in Fig. 8a together with the 
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Fıc. 8. The coupling of electron and proton flow in W. succinogenes QFR. Positive 
and negative sides of the membrane are as described for Fig. 1. (a) Hypothetical trans- 
membrane electrochemical potential as suggested by the essential role of Glu C66 for 
menaquinol oxidation by W. succinogenes QFR (Lancaster et al., 2000). The prosthetic 
groups of the W. succinogenes QFR dimer are displayed (coordinate set 1QLA; Lancaster 
et al., 1999). Distances between prosthetic groups are edge-to-edge distances in angstroms 
as defined by Page et al. (1999). Also indicated are the side chain of Glu C66 and a ten- 
tative model of menaquinol (MKHs) binding. The position of bound fumarate (Fum) is 
taken from PDB entry 1QLB (Lancaster et al., 1999). (b) Hypothetical cotransfer of one 
Ht per electron across the membrane (“E-pathway hypothesis”). The two protons liber- 
ated on oxidation of menaquinol (MKHg) are released to the periplasm (bottom) via the 
residue Glu C66. In compensation, coupled to electron transfer via the two heme groups, 
protons are transferred from the periplasm via the ring C propionate of the distal heme 
bp and the residue Glu C180 to the cytoplasm (top), where they replace those protons 
which are bound during fumarate reduction. In the oxidized state of the enzyme, the 
“E-pathway” is blocked. Modified from Lancaster (2002). Biochim. Biophys. Acta Biomembr. 
1565, 215-231, with permission from Elsevier Science. 


edge-to-edge distances relevant for electron transfer as defined by Page 
et al. (1999). It has been shown for other electron transfer proteins 
that physiological electron transfer occurs if such distances are shorter 
than 14 A, but not if they are longer than 14 A (Page et al., 1999). 
In the case of W. succinogenes QFR, this indicates that physiological 
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electron transfer can occur between the six prosthetic groups of one 
OFR heterotrimeric complex, but not between the two QFR complexes 
in the dimer (Lancaster et al., 2000). 

The fumarate molecule is in van der Waals contact with the isoallox- 
azine ring of FAD. The linear arrangement of the prosthetic groups in 
one QFR complex therefore provides one straightforward pathway by 
which electrons could be transferred efficiently from the menaquinol 
oxidizing site via the two heme groups, the three iron-sulfur clusters, 
and the FAD to the site of fumarate reduction. 

The two heme groups have different oxidation-reduction potentials 
(Kröger and Innerhofer, 1976): one is the “high-potential” heme by, 
the other the low-potential “low-potential” heme bz. For the membrane- 
bound enzyme, these potentials are Ey = —20 mV and Ey = —200 mV, 
respectively (Kröger and Innerhofer, 1976). For the detergent- 
solubilized QFR enzyme, the respective values are —15 mV and —150 mV 
(Lancaster et al., 2000). It has notyet been established which of the hemes 
bp and bp corresponds to bz and by in W. succinogenes QFR. 

Because of its very low midpoint potential (Em < —250 mV; Unden 
et al., 1984), the [4Fe-4S] iron-sulfur cluster has a very low potential and 
has been suggested not to participate in electron transfer [see Hagerhall 
(1997) for a discussion]. However, the determined low potential may be 
an artifact due to anti-cooperative electrostatic interactions between the 
redox centers (Salerno, 1991). The position of the [4Fe-4S] cluster as 
revealed in the structures of W. succinogenes QFR and E. coli QFR is highly 
suggestive of its direct role in electron transfer from the [3Fe-48] cluster 
to the [2Fe-2S] cluster. Despite this major thermodynamically unfavor- 
able step, the calculated rate of electron transfer is on a microsecond 
scale, demonstrating that this barrier can easily be overcome by thermal 
activation as long as the electron transfer chain components are suffi- 
ciently close to promote intrinsically rapid electron tunneling (Dutton 
et al., 1998). 

In addition to the transfer of electrons, two protons are bound on 
fumarate reduction (see reaction 1) and two protons are liberated 
on menaquinol oxidation (see reaction 2). The protons consumed 
on fumarate reduction are undoubtably bound from the cytoplasm 
(see Fig. 8a). The experimental results on intact bacteria, with inverted 
vesicles or liposomes containing W. succinogenes QFR (Kröger et al., 2002; 
Biel et al., 2002), suggest that the oxidation of menaquinol by fumarate 
as catalyzed by W. succinogenes QFR is an electroneutral process. The 
protons formed by menaquinol oxidation have therefore been assumed 
to be released to the cytoplasmic side of the membrane where they bal- 
ance the protons consumed by fumarate reduction. 
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However, the essential role of Glu C66 for menaquinol oxida- 
tion demonstrated by Lancaster et al. (2000) contrasts this interpre- 
tation. There is no conceivable proton transfer pathway from the 
inferred menaquinol oxidation site to the cytoplasmic phase. This 
strongly suggests that the protons liberated during menaquinol oxida- 
tion are released on the periplasmic side of the membrane. In sum- 
mary, the location of the catalytic sites of fumarate reduction and 
menaquinol oxidation in the structure suggests that quinol oxidation 
by fumarate should be an electrogenic process in W. succinogenes, in con- 
trast to the results of experimental measurements for W. succinogenes 
OFR. 

Succinate oxidation by menaquinone, an endergonic reaction under 
standard conditions, is catalyzed by a B-type succinate:quinone oxidore- 
ductase in gram-positive bacteria, e.g., B. subtilis. There is experimen- 
tal evidence indicating that succinate oxidation by menaquinone in 
B. subtilis is driven by the electrochemical proton potential (Schirawski 
and Unden, 1998). This is the analogous reaction to that suggested for 
W. succinogenes QFR (Fig. 8a), but in the opposite direction (Ohnishi 
et al., 2000; Lancaster and Kroger, 2000). In B. subtilis SOR, the site of 
menaquinone reduction, has been proposed to be located close to heme 
bp (Matsson et al., 2000). Recent experimental results indeed indicate 
that B. subtilis SQR generates a proton potential when functioning as a 
OFR (Schnorpfeil et al., 2001). Consequently, there are discrepancies, 
first between the experimental results for B. subtilis SQR (Schirawski 
and Unden, 1998; Schnorpfeil et al., 2001) and W. succinogenes QFR 
(Geisler et ol. 1994; Kröger et al., 2002; Biel et al., 2002) and second 
also between the latter and the implications of the essential role of Glu 
C66 in W. succinogenes QFR. We refer to the latter discrepancy as the 
“W. succinogenes paradox.” 


IX. THE “E-PATHWAY HYPOTHESIS” OF COUPLED TRANSMEMBRANE 
ELECTRON AND PROTON TRANSFER 


Previously, there was no satisfactory explanation for the above ap- 
parent discrepancies. A working hypothesis has been proposed to ex- 
plain them satisfactorily and to resolve the “W. succinogenes paradox” 
(Lancaster, 2002e). According to this hypothesis, the quinol oxidation 
process in W. succinogenes QFR is coupled to the compensating, parallel 
transfer of protons from the periplasm to the cytoplasm. The pathway 
of this proton transfer is transiently established during reduction of the 
heme groups of the enzyme and is not obvious from the available crys- 
tal structure of the oxidized enzyme. Possible constituents of such a 
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proton transfer pathway are the ring C propionate of the distal heme 
and the transmembrane helix V residue Glu C180 (the “E-pathway,” 
Fig. 8b). The latter residue is conserved in the QFR enzymes from 
H. pylori and C. jejuni, but not in B. subtilis SQR. This “E-pathway 
hypothesis” explains why the net reaction catalyzed by W. succinogenes 
QFR does not contribute directly to the generation of a transmem- 
brane electrochemical potential. Such a process is consistent with both 
a distal quinol oxidation site and the observed apparent electroneutral- 
ity of the net reaction. An electroneutral reaction is more consistent 
with the small oxidation-reduction potential difference between the 
fumarate/succinate couple (Em = +25 mV; Ohnishi et al., 2000) and 
the menaquinone/menaquinol couple (Ey = —74 mV; Thauer et al., 
1977). Initial experiments providing support for the E-pathway hypoth- 
esis have been performed (Lancaster, C. R. D., Sauer, U. S., Gross, R., 
Haas. A., Mäntele, W., and Simon, J., manuscript in preparation; Haas, A., 
Sauer, U. S., Gross, R., Simon, J., Mantele, W., and Lancaster, C. R. D., 
manuscript in preparation). 


X. CONCLUDING REMARKS 


An overview of the current status of discussion of electron and proton 
transfer in succinate:quinone oxidoreductases is shown in Figs. 9a—9d. In 
mitochondrial complex II and other C-type enzymes, such as SQR from 
P. denitrificans and E. coli, electron transfer from succinate to ubiquinone 
does not lead to the generation of a transmembrane electrochemical 
potential Ap, since the protons released by succinate oxidation are on 
the same side of the membrane as those consumed by quinone reduc- 
tion [Fig. 9a; see Ohnishi et al. (2000) for a review]. As noted earlier 
(Lancaster et al., 1999), itis unlikely that transmembrane electron trans- 
fer occurs in the E coli QFR, because of the large edge-to-edge distance of 
~25 A between the two quinone models. Therefore, it is most likely that 
quinol oxidation occurs at a proximal site (Fig. 9b). This is supported 
by crystallographic studies of inhibitor binding to E. coli QFR (Iverson 
et al., 2002). The indications that the endergonic oxidation of succinate 
by menaquinone as catalyzed by B-type SQRs in B. subtilis and other 
gram-positive bacteria is driven by Ap (Fig. 9c) have been summarized 
in Section VIII. In the opposite direction, the reduction of fumarate by 
menaquinol does not seem to provide enough energy to allow for the 
generation of Ap. In this case, enzymes with a compensatory “E-pathway” 
are apparently preferred (Fig. 9d). 
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Fic. 9. The coupling of electron and proton flow in succinate:quinone oxidoreduc- 
tases in aerobic (a,c) and anaerobic respiration (b,d), respectively. Positive and negative 
sides of the membrane are as described for Fig. 1. (a) and (b) Electroneutral reactions 
as catalyzed by C-type SQR enzymes (a) and D-type E. coli QFR (b). (c) Utilization of a 
transmembrane electrochemical potential Ap as possibly catalyzed by A-type and B-type 
enzymes. (d) Electroneutral fumarate reduction by B-type QFR enzymes with a proposed 
compensatory “E-pathway.” 
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I. INTRODUCTION 


The mitochondria of eukaryotic cells are the organelles that convert 
energy to forms which can be used in the cell. In the inner membrane of 
the mitochondria, four respiratory enzymes perform a series of redox re- 
actions by transferring electrons through electron carriers to Og, the ter- 
minal electron acceptor (Fig. 1A). NADH dehydrogenase (Complex I) 
and succinate dehydrogenase (Complex II) reduce membrane-bound 
ubiquinone using NADH and succinate, respectively, which are pro- 
vided by the TCA cycle in the matrix space. The reduced ubiquinone 
(ubiquinol) can freely diffuse within the membrane to other com- 
plexes. Using this ubiquinol, the cytochrome bc; complex (ubiquinol- 
cytochrome c oxidoreductase or Complex III) reduces cytochrome c, the 
sole soluble protein component in the respiratory chain. The reduced 
cytochrome c is then utilized by cytochrome c oxidase (Complex IV) 
to reduce oxygen molecules to water molecules. The free energy re- 
leased by electron transfer is captured as chemical energy in the form 
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Fic. 1. A schematic illustration of the mitochondrial (A) and Escherichia coli (B) respi- 
ratory chains. Respiratory enzymes perform a series of oxidation-reduction reactions by 
transferring electrons (dashed lines) through mobile electron carriers. Electron transfer 
is coupled to the pumping of protons (thick black arrows) from the N-side (negative 
side) to the P-side (positive side) generating a proton gradient that ultimately drives the 
conversion of ADP to ATP. 


of a proton gradient that can be used to drive the conversion of ADP to 
ATP. 

Many prokaryotic organisms such as Escherichia coli have a simplified 
respiratory chain located in the inner cell membrane (Fig. 1B). The 
E. coli respiratory chain performs a function similar to as its mitochon- 
drial counterpart but lacks Complex III. Instead, electrons are directly 
transferred from the ubiquinol molecule to Complex IV (ubiquinol 
oxidase in this case). 

In both systems, membrane-bound ubiquinone plays crucial roles in 
the respiratory chain. Indeed, various quinones, including ubiquinone 
and menaquinone, are used to connect the redox reactions of various 
membrane proteins. In spite of the large amount of biochemical and bio- 
physical data on quinone and quinone binding proteins, little structural 


STRUCTURE AND FUNCTION OF QUINONE BINDING MEMBRANE PROTEINS 153 


information on the precise action mechanism of quinone in the res- 
piratory chain is available. So far the structures of the following pro- 
teins have provided structural information on quinone binding sites: 
the photosynthetic reaction center (Deisenhofer et al., 1985; Allen et al., 
1986), cytochrome bc; complex (Xia et al., 1987; Zhang et al., 1998; 
Iwata et al., 1998; Hunte et al., 2000), fumarate reductase (Iverson 
et al., 1999; Lancaster et al., 1999), ubiquinol oxidase (Abramson et al., 
2000), photosystem I (Jordan et al., 2002), and formate dehydrogenase 
(Jormakka et al., 2002). In this review, we will discuss the structure and 
function of the cytochrome bc; complex from bovine mitochondria and 
ubiquinol oxidase from E. coli focusing on the quinone binding sites 
within these proteins. 


IJ. STRUCTURE OF CYTOCHROME bc; COMPLEX FROM BOVINE 
HEART MITOCHONDRIA 


A. Cytochrome bc, Complex 


The cytochrome bc; complex was isolated in the early 1960s (for re- 
view see Clejan and Beattie, 1986) but its redox centers had been already 
identified by Keilin in 1925 and Rieske in 1964 (for review see Edwards 
and Trumpower, 1986). The cytochrome bc, complex not only is part 
of the oxygen respiratory chain but is also involved in bacterial pho- 
tosynthesis. Additionally, a related enzyme, the cytochrome be f com- 
plex, takes part in photosynthesis in chloroplasts. As mentioned above, 
the cytochrome bc; complex is an intermediate component of the oxy- 
gen respiratory chain which catalyzes the reduction of cytochrome c 
using membrane-bound ubiquinol. During the reaction, two protons 
are translocated across the membrane by a mechanism called “the pro- 
ton motive Qcycle,” discussed in the next section. Although mammalian 
cytochrome bc; complexes have 11 subunits, the number of the subunits 
varies between species from 3 to 11. The cytochrome bc, complex from 
bovine mitochondria exists as a dimer of the 11 subunits with a total 
molecular weight of about 480,000 (Schagger et al., 1986, 1995). The 
subunit composition of the bc; complex is shown in Fig. 2. 

All known cytochrome bc; complexes have three essential subunits, 
which contain all four redox centers in common. These subunits are cy- 
tochrome b (cyt. b), cytochrome cı (cyt. c1), and the Rieske [2Fe-2S] pro- 
tein (ISP). The redox centers are heme br, and heme by in cytochrome b 
(L and H represent low and high potential, respectively), heme cı in 
cytochrome cı, and Rieske [2Fe-2S] (FeS) cluster in ISP. 
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Fic. 2. Subunit composition of the bovine bc; complex. 


B. Redox-Active Subunits and the Proton Motive Ocycle 


The reaction mechanism of cytochrome bc, complex is known as “the 
proton motive Qcycle” originally proposed by Peter Mitchell (Mitchell, 
1976). This mechanism is the basis of his chemiosmotic theory for which 
he was awarded the Nobel prize in 1978. Since then, the enzyme has been 
characterized extensively using various techniques. Redox centers have 
been characterized spectroscopically (for review, see Trumpower and 
Gennis, 1994), electron transfer pathways have been determined using 
kinetic experiments with specific inhibitors (De Vries 1986; Zhu et al., 
1984), and the positions of quinone binding sites and redox centers have 
been determined using biochemical and mutational analysis (for review, 
see Esposti et al., 1993; Brasseur et al., 1996). As a result of these efforts, 
the latest modified Qcycle has been widely accepted by researchers in 
the field (for reviews, see Crofts et al., 1983; Trumpower, 1990; Berry 
et al., 2000). 

Figure 3 summarizes the subunits and redox centers involved in the 
Ocycle. As explained before only three subunits of the bc; complex, 
namely cytochrome b, cytochrome cı, and ISP, are essential for the 
Qcycle. Cytochrome b has heme b molecules, heme by and heme An 
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Fic. 3. Schematic view of the proton motive Qcycle. 





associated with two quinone-binding sites Qp and On, respectively. 
P and N represent the positive and negative sides of the membrane. 
Qp and Qx sites are also known as Q, and Q; sites, where o and i repre- 
sent proton output and input. The Qp site is the ubiquinol oxidation site 
on the “out” side of the membrane and the Qx site is the ubiquinone 
reduction site on the “in” side of the membrane. The Rieske FeS protein 
has the Rieske-type [2Fe-2S] iron sulfur cluster (FeS cluster), whose li- 
gand is two histidine (His-141 and His-161) and two cysteine (Cys-138 and 
Cys-158) side chains instead of four cysteine side chains for ferredoxin 
type [2Fe-2S] cluster. Cytochrome cı has heme cı, which transfers the 
electron to soluble cytochrome c. 

The Qcycle can be summarized starting from Or site as follows. 
Membrane-bound ubiquinol binds to the Qp site. During oxidation of 
ubiquinol the release of two protons to the mitochondrial intermem- 
brane space is coupled to electron transfer. A ubiquinol molecule can 
donate two electrons when fully oxidized. The first electron is taken 
up by FeS cluster of ISP, then passed to heme cı of cytochrome cı, 
where it is subsequently transferred to soluble cytochrome c. Interest- 
ingly, the second electron is transferred to bound ubiquinone in the 
Qn site through heme by, and by yielding a ubisemiquinone. At the 
Qn site this ubisemiquinone, which is usually unstable, is stabilized by 
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the protein environment. At the Qp site another ubiquinol molecule re- 
places the oxidized ubiquinone and the oxidation reaction is repeated. 
During the second reaction, the second electron is transferred to the 
ubisemiquinone in the Qy site yielding fully reduced ubiquinol. For the 
full reduction of ubiquinone, two protons are taken up from the ma- 
trix and the (re-) generated ubiquinol is released into the membrane. 
The released ubiquinol flips to the other side of the membrane and is, 
subsequently, used as a substrate in the Qp site. Thus ubiquinone itself 
works as a proton carrier in the Qcycle and there are no proton path- 
ways in the protein. In contrast, terminal oxidases, which use a “proton 
pump” for proton translocation, have specific proton pathways within 
the molecule. 

The key step of the Qcycle is the “electron bifurcation” at the Qp site. 
The FeS cluster has a far higher redox potential (+300 mV) than that 
of heme br (—90 mV). Considering the redox potential, both electrons 
could be transferred to the FeS cluster, although this never occurs in 
reality. The first electron is always transferred to the FeS cluster and the 
second electron to heme b. There was no clear answer how electrons 
could be bifurcated against the redox potential until the structures of 
the bc; complex were revealed. 

In the Qcycle mechanism, two different types of quinone binding sites 
play critical roles. We will analyze different types of the quinone-binding 
sites in Section ULC 


C. Overall Structure 


Crystals of the whole cytochrome bc; complex have been reported 
by different groups since 1991. Within them, Yu and Deisenhofer’s 
group succeeded in providing the first structure of the complex in 1997 
(Xia et al., 1997). They have also shown the structures of various in- 
hibitor complexes and revealed the positions of the quinone binding 
sites. Because of disorder, some of the subunits and domains, includ- 
ing the extrinsic domains of ISP and cytochrome cı, were missing in 
the structure (a more complete structure was reported later (Kim et al., 
1998)). The structure was followed by the chicken bc; complex struc- 
ture (Zhang et al., 1998), which showed all 10 subunits of the complex. 
They built the structure using bovine sequence (75-90% similarity for 
known part) since, at the time, the sequence of the chicken complex sub- 
unit was unknown except for cytochrome b (Zhang et al., 1998). Most 
importantly, they found the movement of the ISP extrinsic domain by 
comparison of the native and stigmatellin (a Qp site inhibitor) com- 
plex structure. The structure of the bovine be complex from our group 
followed the structure of the chicken Ae complex (Iwata et al., 1998). 
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This was the first complete structure of the bc; complex. The structure 
provided information about all 11 subunits and revealed that subunit 9, 
the mitochondrial targeting presequence of ISP, exists between two core 
subunits, which are most likely a mitochondrial targeting presequence 
peptidase. We have solved the structures of the bc; complex in two dif- 
ferent crystal forms. Surprisingly, the conformation of the Rieske FeS 
protein was totally different between two crystal forms, and this pro- 
vided a crucial insight of the electron bifurcation mechanism at the Q, 
site (see Section II,F). 

Figure 4 (see color insert) shows the overall structure of the bc, com- 
plex dimer form with a view parallel to the membrane and perpendicular 
to the noncrystallographic twofold axis. Each monomer is related by the 
twofold axis perpendicular to the membrane. The ISP extrinsic domain 
was found in different positions for different crystal forms as mentioned 
above. This difference was found to have a very important physiologi- 
cal implication in electron bifurcation at the Qp site as discussed below. 
Interestingly, the transmembrane helix and the extrinsic domain of ISP 
are associated with different monomers within the dimer. This “domain 
swapping” strongly suggests that the physiological unit of the be com- 
plex is, indeed, dimer. 

Figure 5 is aview of the dimer from the mitochondrial intermembrane 
space. Transmembrane helices, hemes bi, and bu, inhibitors, and lipid 
molecules are shown. Site-specific inhibitors are superimposed in order 
to illustrate the positions of the quinone binding sites. The Qp site, 
characterized by myxothiazol, was located by heme fr. toward the “out” 
side (the mitochondrial intermembrane space side) of the membrane, 
whereas the Qn site, characterized by antimycin A, was found by heme 
by toward the “in” side (the matrix side) of the membrane. 

Interestingly, the Qp site from one monomer is close to the Qy site of 
the other monomer rather than to the Qy site of the same monomer. 
Moreover, this pair of the Qp and Ox sites from different monomers 
exists within the same hydrophobic cavity. The be complex dimer has 
four hydrophobic cavities: two large cavities between the monomers and 
one small cavity within each monomer. The small cavities are occupied 
with firmly bound phospholipid molecules. These lipid molecules are 
from the native mitochrondrial phospholipid membrane, maintained 
with the protein throughout the purification procedure. This means 
these lipids are not exchangeable. Each large cavity maintains only one 
firmly bound lipid molecule, while the rest of the cavity is filled with 
disordered electron densities. These densities could be interpreted as 
disordered detergent molecules, which indicates that the environment 
within the large cavities is highly mobile. This could allow fast exchange 
of the quinone and quinol molecules between the two sites and the 
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Fic. 5. Schematic view of the transmembrane section of the cytochrome be complex 
looking down from the intermembrane space. Transmembrane helices, hemes, bound 
inhibitors, and phospholipids are shown. 


Q-pool outside. The minimum distance between myxothiazol and an- 
timycin A molecules is only 18 A; this indicates that the two sites can 
exchange a quinol/quinone molecule very quickly by simply flipping 
the head group. 


D. Cytochrome b and Quinone Binding Sites 


Cytochrome b is the only subunit of the complex encoded within 
the mitochondrial DNA. Cytochrome b% is located in the center of the 
complex and mostly buried within the membrane. Both the NH: and 
COOH-termini are on the matrix side. Cytochrome b is composed of 
8 transmembrane a-helices («A to aH) and four horizontal a-helices 
(aab, acdl, acd2, and wef) on the mitochondrial intermembrane space 
side (Fig. 6). These helices are connected by four long (ab, cd, de, and ef) 
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Fic. 6. Structure of cytochrome b. Transmembrane helices (A-H), horizontal helices 
(a, cdı, cds, ef), hemes (bj, and bu), and specific inhibitors (myxothiazol, antimycin A) 
are shown. (A) A view parallel to the membrane. (B) A view parallel to the membrane 
normal arrangement of the transmembrane helices is shown. Heme b; is on the front 
and by is toward the back. 
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and 3 short (bc, fg and gh) loops. Hemes Ay and by are maintained 
within a bundle of four a-helices, «A, «B, oC and aD. Hemes by and by 
existon the intermembrane space and the matrix sides, respectively. The 
ligands are His-83, His-182 for heme b; and His-97, His-196 for heme by 
as predicted from biochemical studies (Yun et al., 1991). These histidine 
residues are conserved in all known b type cytochromes (Widger et al., 
1984; Esposti et al., 1993). 

The Qx site was found in a hydrophobic pocket near heme by, formed 
by the residues from gA, aD, ok. and aab, on the matrix side in cy- 
tochrome b (Fig. 6). In the native structure, the Qn site seems empty 
and some water molecules are bond to His-201 and Asp-228. Antimycin 
A was found to be hydrogen bonded to Asp-228, Ser-35, Asn-32. It has 
also van der Waals contact with Phe-18, Tyr-224, Met-190 and a propi- 
onate group of the heme dy. The hydrogen bond network between the 
inhibitor and the side chains in the binding pocket explain the high 
affinity and specificity of this inhibitor. UQ2 binds to a slightly different 
location in the Qx site than the antimycin. Figure 7 shows a proposed 
quinone reduction mechanism at the Qy site based on X-ray structures. 
First, an oxidized ubiquinone molecule from the membrane binds be- 
tween His-201 and Asp-228. Both His-201 and Asp-228 should be proto- 
nated. The first electron transfer from heme by is coupled to the transfer 
of a proton from Asp-228 yielding a ubisemiquinone. To neutralize the 
negative charge, one proton is taken up to Asp-228 through the water 
channel. The formed ubisemiquinone should be stabilized until the sec- 
ond electron transfer occurs. The stabilization mechanism is still unclear, 
although the residues and groups in vicinity of the ubisemiquinone, in- 
cluding His-201 and heme bu, which have direct contact to the quinone 
ring, could play crucial roles. Coupled to the second electron transfer, 
another proton is taken from His-201 yielding a fully reduced ubiquinol. 
To compensate for the negative charge, another proton is taken up to 
His-201 from the matrix space. Finally, the fully reduced ubiquinol is 
released into the membrane. 

The Qp site was found in a hydrophobic pocket near heme br, 
formed by residues from ot. gF, and gef, below acdl, on the inter- 
membrane space side of cytochrome b. No substrate structure in the 
Qp site has been reported. The site has been characterized using var- 
ious inhibitors by a number of different research groups. The Qp site 
inhibitors are spectroscopically divided into two types: class I, inhibitors 
which affect heme by (myxothiazol and MOA type inhibitors); class I, in- 
hibitors which affect both heme by and FeS cluster [stigmatellin, UBDBT 
[5-undecyl-6-hydroxy-4,7-dioxobenzothiazol).] We have succeeded in 
obtaining a class I inhibitor structure (myxothiazol); however, we were 
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Fic. 7. (A) Quinone binding mode at the Qn site. The residues involved in the binding are shown. (B) Possible quinone reduction 
mechanism in the Qy site. 
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Fic. 8. Ubiquinol binding model in the Op site. Ubiquinol was modeled based on the 
inhibitor-bound bc; complex structures. 


unable to obtain the class IL inhibitor complex structure. Based on these 
results, a possible ubiquinol-bound structure was modeled as in Fig. 8. 
A ubiquinol occupies the same domain as myxothiazol with a small shift 
in such a position that it can bind to both His-161 (ISP) and flipped 
Glu-271 (cytochrome b) at the same time. 


E. ISP Extrinsic Domain Movement and Electron Bifurcation 


ISP has one transmembrane helix at the NH»-terminal end the ex- 
trinsic domain at the COOH-terminal end, which are connected by a 
flexible linker with a short helix. A high resolution (1.5 A) structure of 
the extrinsic domain (residues 71-196) has been reported previously 
(Iwata et al., 1996). The domain is composed of 9 6-strands and one 
a-helix and has two subdomains called base and cluster-binding folds. 

The extrinsic domain of ISP was found at different positions in the 
P6522 and P6; forms (Iwata et al., 1998). Various groups have also pub- 
lished different positions of the ISP extrinsic domain (Xia et al., 1997; 
Kim etal., 1998; Zhang et al., 1998; Hunte etal., 2000). These ISP positions 
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are classified into three different groups using FeS-heme cı and FeS- 
heme br, distances (Iwata et al., 1998). The first group is composed of 
the positions observed in P6522 and P2; crystals from bovine and P6522 
rabbit enzymes and native P2,2)2); from chicken enzyme (Zhang et al., 
1998). The ISP position for the group is called “c1,” because the ISP is 
located in close proximity to the cytochrome cı (Fig. 9). In the case of 
bovine P6522, a direct hydrogen bond between a propionate of heme 
cı and His-161 of ISP was observed. The next group contains bovine 
14,22, chicken, and yeast stigmatellin complex forms. The ISP posi- 
tion for this group is called “b” because ISP is in close proximity of 
cytochrome b. In the stigmatellin complex structures, a direct hydrogen 
bond between stigmatellin and His-161 of ISP was observed. Bovine P65 
form is a member of the last group, where the ISP position is called 
“Intermediate Out." Similar positions were reported for the bovine 
14,22 crystal forms with various inhibitors (Kim et al., 1998). In this po- 
sition, the extrinsic domain of ISP stays between “cı” and “b” positions 
on loop ef of cytochrome b. Here, His-161 of ISP forms no hydrogen 
bond. 

This extrinsic domain movement is explained using a combination of 
two rotations: (i) whole domain rotations against the rest of the complex; 
and (ii) the relative rotations within the domain between base and cluster 
binding folds. Between these, the rotation of the whole domain is more 
prominent than the rotation within the domain. Therefore, as a first 
approximation, the conformational change of ISP is well described by 
rotation of the whole extrinsic domain. 

To study the flexible linker region of ISP, which seems to be impor- 
tant for the domain motion, various mutations of the bc; complexes from 
Rhodobacter capsulatus and yeast have been produced and characterized 
by research groups of Daldal and Trumpower, respectively (Darrouzet 
et al., 2000 and Nett et al., 2000). In these studies, both insertion and 
deletion mutants of the linker region caused some loss of the activity; 
however, the insertion mutants showed a more severe loss. The low ac- 
tivity of the mutants is probably due to restriction of domain movement, 
since the electron transfer rate from ubiquinol to ISP is not seriously 
affected in these mutants. Although the effects of the insertion or dele- 
tion could be more complex than just changing the length, the results 
clearly showed the extrinsic domain movement is a well-controlled pro- 
cess and essential for activity. If the movement of the ISP is too slow, the 
rate of enzymatic turnover could be reduced, and if the movement is 
too fast, short-circuit of electron transfer could result in dysfunction of 
the Qcycle (i.e., no electron bifurcation at the Op site). Darrouzet et al. 
also mentioned that the inhibitory effects of a linker insertion mutant 
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Fic. 9. Three different conformations of ISP in relation to its neighbors. (A) Struc- 
ture in the “cı” form. An inhibitor, myxothiazol, bound to the Op site is also shown. 
(B) Structure in “Int” form. (C) Structure in the “b” form. An inhibitor, stigmatellin, 
bound to the Qp site is also shown. 
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could be reversed by a second mutation at Leu-286 (Leu-262 in bovine) 
in the loop ef. It is suggested that the conserved loop ef could be critical 
for rate control of ISP motion. 


F Electron Bifurcation Mechanism at the Qp Binding Pocket 


Based on the positions of metal centers and quinone binding sites 
and the different ISP positions, the electron bifurcation can be well 
explained (Zhang et al., 1998; Iwata et al., 1998, 1999; Crofts et al., 1999). 

Figure 10 shows the proposed ubiquinol oxidation and electron bi- 
furcation mechanism at Qp site. (A) In the absence of the ubiquinone, 
the side chain of Glu-271 is connected to the solvent in the mitochon- 
drial intermembrane space via a water chain. (B) As a reduced ubiquinol 
molecule binds to the site, the side chain of Glu-271 flips to form a hydro- 
gen bond to the bound ubiquinone. (C) Now, the ISP, which is moving 
around the intermediate position by thermal motion is trapped at the “b” 
position by a hydrogen bond to the bound ubiquinone. (D,E) Coupled 
to deprotonation, the first electron transfer occurs. Since the Rieske FeS 
cluster has a much higher redox potential (ca. +300 mV) than heme b; 
(ca. 0 mV), the first electron is favorably transferred to ISP. This yields 
ubisemiquinone, (F,G). After ubisemiquinone formation, the hydrogen 
bond to the His-161 of ISP is destabilized. The ISP moves to the “cı” 
position, where the electron is transferred from the Rieske FeS cluster 
to heme cı. Now unstable ubisemiquinone is left in the Qp pocket. The 
redox potential of the deprotonated ubisemiquinone is assumed to be 
several hundred millivolts. Now the electron transfer to the heme by 
is a “downhill” reaction. (H) Coupled to the second electron transfer, 
the second proton is transferred to Glu-271 and subsequently to the 
mitochondrial intermembrane space. The fully oxidized ubiquinone is 
released to the membrane. 


I. THE STRUCTURE OF CYTOCHROME bog UBIQUINOL OXIDASE FROM 
Escherichia coli 


A. Cytochrome bog Ubiquinol Oxidase 


Cytochrome bog ubiquinol oxidase from £. coli is a four-subunit heme- 
copper oxidase that catalyzes the four-electron reduction of Og to water 
and functions as a proton pump (Puustinen et al., 1991; Fig. 11). All 
redox centers are located within the largest subunit (subunit I), with a 
low spin protoheme (heme b) acting as the electron donor to a binuclear 
center that is composed of an O-type heme (heme 03) and a copper ion 
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Fıc. 10. Possible electron bifurcation mechanism at the Op site. See text for details. 
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Fıc. 11. A schematic illustration of subunits I (SUI) and II (SUII) of cytochrome c and 
ubiquinol oxidases. The proton pathways used for proton pumping and for the delivery of 
protons to the Os binding site (thick black lines) are called the D- and K-pathways (named 
after the conserved residues Asp for the D-pathway and Lys for the K-pathway). Dashed 
lines depict the electron path from the substrate to the Og binding site. In cytochrome 
c oxidases, electrons are delivered to the copper A center (Cua) from the substrate, 
cytochrome c, and then passed onto the low-spin heme (heme a) and finally to the 
binuclear center (heme a3-Cug). In ubiquinol oxidases, electrons are delivered directly 
to the low-spin heme (heme b) by the substrate, ubiquinol, and then to the binuclear 
center (heme o3—Cup). 


(Cus). Subunits I, II, and III of ubiquinol oxidase are homologous to the 
corresponding subunits in the aas-type cytochrome c oxidases, and the 
ligands of the two heme groups and of Cug have been identified as 
the invariant His residues (Lemieux et al., 1992). Subunit IV has little or 
no sequence homology with other oxidases and its function is unknown. 
In contrast to the cytochrome c oxidases, subunit II of ubiquinol oxi- 
dase has no Cu, center, nor does it have a cytochrome c binding site. 
Instead, heme b receives electrons directly from a membrane solubilized 
ubiquinol molecule, and the protons produced are released on the “out” 
side of the membrane. The crystal structure of the ubiquinol oxidase 
from E. coli has been solved at 3.5 A resolution (Abramson et al., 2000). 
The structure confirms that the overall architecture of this complex is 
very similar to that of cytochrome c oxidase. Using site-directed muta- 
genesis and sequence comparisons with known ubiquinone binding sites 
from other membrane proteins, we have identified a novel ubiquinone 
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binding site within this enzyme. These studies reveal how ubiquinol 
oxidase and cytochrome c oxidase can use a similar protein framework 
for oxygen reduction but with two entirely different electron donors. 


B. Overall Structure 


The cytochrome bo; structure is composed of four subunits containing 
25 membrane-spanning helices with a total molecular mass of 130 kDa 
(Fig. 12, see color insert). The overall protein architecture is similar to 
that of the other cytochrome c oxidases solved so far. In fact, cytochrome 
bog can be superimposed over Paracoccus denitrificans cytochrome c oxi- 
dase (Paracoccus-COX) with an rms difference of 1.6 A for 781 Cy atoms 
using a distance cutoff of 3.8 A (Iwata et al., 1995). This is as expected 
since numerous biochemical studies have shown these enzymes to per- 
form the same function in a similar manner. In the following section the 
amino acid numbering will be for cytochrome bog. 

Subunit I of cytochrome bos has 15 membrane-spanning helices. 
There are three additional helices in cytochrome bog that are not present 
in subunit I of the other oxidase structures: one at the N-terminus and 
two at the C-terminus (Chepuri et al., 1990). As expected, the barrel 
structure composed of the twelve membrane-spanning helices (helices 
I-XII) is almost identical to those found in Paracoccus-COX and bovine 
cytochrome c oxidase (bovine-COX, Tsukihara et al., 1995). Both the low- 
spin heme and the binuclear center have identical coordination patterns 
and reside in similar positions in Pores Cand B, respectively. The low-spin 
heme (heme b) is the initial electron acceptor coordinated by His-106 
and His-421. From here, the electrons are passed to the binuclear center 
composed of the high-spin heme (heme 03), coordinated by His-419, and 
the Cug, coordinated by His-284, His-333, and His-334. As expected from 
sequence analysis, there are no nonredox metal ions (ParacoccusCOX 
Mai" /Mn?* or bovine-COX Ca?+/Na*) observed in the structure of cy- 
tochrome bo3. The two polar channels (D- and K-pathways) that were 
found in bovine-COX and Paracoccus-COX are also present in subunit I 
of cytochrome bog (Iwata et al., 1995; Yoshiukawa et al., 1998). Both 
channels maintain a similar arrangement of residues or moieties to fa- 
cilitate proton movement from the cytoplasm to the binuclear center. 
The D-pathway again begins with an aspartic acid (Asp-135), proceeds 
through Asn-124, Thr-211, Asn-142, Asn-124, Tyr-61, Thr-204, Ser-145, 
Thr-201, and Thr-149, and terminates at Glu-286. Likewise, the cavity 
for the K-pathway is formed by Ser-315, Ser-299, Lys-362, Thr-359, the 
OH group of the hydroxyethylfarnesyl tail of heme 03, and Tyr-288. 

Subunit II is composed of two membrane-spanning helices and a 
C-terminal extrinsic domain that is situated above the membrane on 
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the periplasmic side. In the second membrane-spanning helix there is 
a conserved glutamic acid residue (Glu-89) that has a similar conforma- 
tion to its homologue in ParacoccusCOX. Glu-89 has been suggested as 
an alternative input site for regulating proton entry or transit through 
the K-pathway (Ma et al., 1999). Subunit III of cytochrome bog has five 
helices that can be superimposed onto helices III through VII of cy- 
tochrome c oxidase with an rms difference of 1.5 A for 174 Cy atoms 
using a distance cutoff of 3.8 A. A positional alignment places helices 
(0) and (XIV) from subunit I of cytochrome bos over helices I and II 
from subunit III of Paracoccus-COX. Subunit IV of cytochrome bog has 
three membrane-spanning helices that are in contact with both subunits 
I and II. The function of subunit IV is unknown, but it may be important 
in maintaining an intact structure at the binuclear center. The structure 
of cytochrome bog revealed that the third helix from subunit IV interacts 
with helix VII from subunit I, near the binuclear center. 


C. Ubiquinone Binding Sites in Membrane Proteins 


Ubiquinones are energy transducers that are obligatory in many res- 
piratory and photosynthetic electron transport chains. The ubiquinone 
enzymes involved in these reactions usually function in a manner that 
couples the electron transfer by the ubiquinone to proton translocation 
across the membrane.The structural makeup of the ubiquinone active 
site permits varying functional roles that influence the electron and pro- 
ton chemistry. 

Ubiquinone binding sites could be categorized into three different 
types termed donor, acceptor, and pair-splitting sites according to Fisher 
and Rich (2000). In both the donor and acceptor ubiquinone binding 
sites, there are two half-reactions which proceed with near equal po- 
tentials and pass through a stable semiquinone intermediate. In the 
case of the pair-splitting ubiquinone binding site, there is no stable 
semiquinone and two separate electron carriers perform the oxidation 
of the ubiquinol with large differences in redox potentials. It is this 
large difference in redox potential that allows the enzyme to bypass the 
semiquinone intermediate in this ubiquinone site. Although there have 
been more than 50 types of ubiquinone binding sites identified in elec- 
tron transport chains, no real identifiable sequence motif has been ob- 
served except for a general triad assembly of residues on a single a-helix 
flanking the ubiquinone head group (Fisher and Rich, 2000). However, 
a structural motif is beginning to unfold based on the crystallographic 
data available from the available structures (see Section I). 

In both the Qg site of RC and the Qn (Fig. 7) site of the be com- 
plex, a semiquinone intermediate is stabilized. This mechanism allows 


170 MOMI IWATA ET AL. 


the ubiquinone to switch between two distinct one-electron transfers. 
Four contact points between the ubiquinone molecule and the enzyme 
were revealed in a brief structural analysis of these ubiquinone binding 
sites (Murray et al., 1999). The ubiquinone molecule is stabilized in a 
hydrophobic pocket where the ring structure of the ubiquinone is sand- 
wiched between two hydrophobic residues (contact points 1 and 2). The 
distal oxygen (distal from the hydrophobic tail) of the ubiquinone ring is 
always hydrogen bonded with a histidine residue (contact 3). The prox- 
imal oxygen is hydrogen bonded by various hydrogen bond partners 
(contact 4). The approximate dimensions of the binding site is 9.5 A 
from the hydrogen bonding residues (contact point 3 and D and 7.5 A 
between hydrophobic residues (contact points 1 and 2). 

The cytochrome bcı complex maintains a pair-splitting ubiquinone 
binding site, Qp. The Qp site removes the need for the semiquinone in- 
termediate through the bifurcation of electrons to the ISP and heme by. 
Unfortunately, there is no structure containing a ubiquinone at the Qn 
site as mentioned, but complexes with inhibitors have been reported. 
In the presence of different inhibitors the hydrogen bonding pattern 
at the Qp site and position of the ISP changes. Figure 8 shows four 
contact points positioned in a similar manner to the other ubiquinone 
binding sites presented. The binding pocket maintains the sandwich of 
hydrophobic residues (contacts 1 and 2) in the wild type as well as all 
inhibitor structures. Furthermore, a histidine residue from the ISP is hy- 
drogen bonded to the distal oxygen (contact 3) and there is a glutamate 
residue hydrogen bonded to the proximal oxygen (contact 4). 


D. A Novel Ubiquinone Binding Site in Cytochrome bog 


Unlike the other ubiquinol binding sites presented above, the 
ubiquinol binding site in cytochrome bos stabilizes a semiquinone dur- 
ing the oxidation of ubiquinol. A previous model placed the ubiquinone 
binding site of cytochrome bos within the extrinsic domain of subunit II, 
replacing the Cug site (Murray et al., 1999). However, several lines of 
evidence are available to refute this model. 

To begin with, the proposed ubiquinone binding site was unlike any of 
the previous sites that have been described in membrane proteins that 
bind ubiquinone. As presented above the photosynthetic reaction center 
(Deisenhofer et al., 1995), the bc; complex (Xia et al., 1987; Zhang et al., 
1998; Iwata et al., 1998; Hunte et al., 2000), and fumarate reductase 
(Iverson et al., 1999; Lancaster et al., 1999) bind the ubiquinone 
molecule within membrane-spanning helices where the ring of the 
ubiquinone molecule is oriented near the phospholipid head group of 
the membrane. Second, a large mutational study of residues at or around 
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this region in cytochrome bog had little to no effect on the enzyme 
activity (Ma et al., 1998). Third, ubiquinone is dissolved within the 
lipid bilayer and the proposed binding site is clearly outside the 
membrane-associated region. It is unlikely that the ubiquinone would 
leave the favorable environment of the membrane to release electrons. 
Finally, the structure of the soluble fragment of subunit II (Wilmanns 
et al., 1995) is virtually identical to the same domain in the intact 
cytochrome bog with the exception of a shift in a small loop composed 
of residues 209-217. This structural difference is small but important 
as it renders the previously proposed binding site inaccessible. 

It has been proposed that cytochrome bog has two ubiquinone 
binding sites, a high-affinity (Ou) and a low-affinity (Q1) site. The 
low-affinity site is said to function as a pair-splitting site donating one 
electron to heme band the other to the high-affinity site (Sato-Watanabe 
et al., 1994). The Op site maintains a tightly bound ubiquinone when 
the enzyme is solubilized in the detergent dodecyl maltoside (DDM) 
(Sato-Watanabe et al., 1994; Puustinen et al., 1996). Unfortunately, 
crystals could only be obtained using the detergent octyl glucoside 
(OG), under which conditions ubiquinone appears not to bind. Because 
of these difficulties, we have been unable to obtain a cytochrome bog 
structure complexed with ubiquinone or any of its analogues. 

Based on similarities with known structural motifs in membrane 
proteins which bind ubiquinone and biochemical data, a potential 
ubiquinone binding site was located in the membrane domain of 
subunit I of cytochrome bos. The proposed binding site is formed 
from a patch of conserved polar residues located near the surface of 
transmembrane helices I and II. To maintain such an energetically 
unfavorable structure in a hydrophobic environment suggests this 
feature may be functionally important. Sequence alignments show 
that the residues within this polar cluster, Arg-71 and Asp-75 in the 
C-terminal half of helix I, and His-98 and Gln-101 in the N-terminal 
half of helix II, are conserved in the ubiquinol oxidases but not in the 
cytochrome c oxidases (Fig. 13). Subunit I between cytochrome c and 
ubiquinol oxidases shows a high degree of homology but the sequence 
in this particular region is not conserved. 


E. Analysis of Active Site Mutations 


In order to ascertain the functional relevance of this polar cluster, 
site-specific mutagenesis was performed on the residues Arg-71, Asp-75, 
His-98, and Gln-101. The mutations of Arg-71, Asp-75, and His-98 all 
showed a block in enzymatic activity (Abramson et al., 2000). Even the 
most conservative Q101N mutation inhibited activity by 75% and caused 
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Fic. 13. Sequence alignment of membrane-spanning helices I and II in subunit I dis- 
playing the conserved polar residues of cytochrome bos. Five ubiquinol oxidase sequences 
(top) and three cytochrome c oxidase sequences (bottom) are aligned. Residues mutated 
in this work are boxed in light gray and other residues discussed in the text are boxed 
in dark gray. Helices I and II are marked. The histidine ligand of the low-spin heme, 
which is conserved in all heme-copper oxidases, is also shown in a black box in helix II. 
An interhelical loop in the Rhodobacter sphaeroides sequence has been omitted as marked. 
The following sequences were used: (top) E. coli, CYOB_ECOLI (Swissprot) ; Pseudomonas 
putida, BAA76357 (Genbank); Acetobacter aceti, QOX1_ACEAC (Swissprot); Paracoccus 
denitrificans, B54759 (Genbank); Bradyrhizobium japonicum, CO14_BRAJA (Swissprot); 
(bottom) Bos taurus, COX1_BOVIN (Swissprot); P. denitrificans, CX1B_PARDE (Swiss- 
prot); R. sphaeroides, COX1_RHOSH (Swissprot). 


a 10-fold increase in the apparent Ky value for ubiquinol-l. Optical 
spectra of all these mutants showed no substantial perturbations, indi- 
cating no large structural changes near the heme groups. 

Electrochemical studies of the wild-type as well as QIOIN mutant 
enzyme reconstituted in proteoliposomes was monitored by Wikström’s 
group and proved to be a powerful tool for following electron transfer 
(Abramson et al., 2000). The electrometric response of the Q101N mu- 
tant showed that electron transfer from the bound ubiquinol molecule 
was lost. In the presence of bound ubiquinol, the initial fast electrometric 
response on oxidation of the reduced wild-type enzyme by Os can be as- 
signed to the conversion of the enzyme-Os complex (A state) to the 
oxoferryl state (F state). The subsequent slower phase, with equal am- 
plitude to the fast phase, is assigned to reduction of the oxoferryl state 
(F state) by the fourth electron from ubiquinol (O state, Jasaitis et al., 
1999). Without bound ubiquinol, the fully reduced enzyme has only 
three electrons available for oxygen reduction, so the reaction becomes 
truncated at the oxoferryl state (F state, Puustinen et al., 1996). Oxygen 
reduction in the Q101N mutant enzyme stops at this state, indicating 
that normal fast electron transfer from bound ubiquinol is disrupted 
in this mutant. The validity of this method to assess normal binding of 
ubiquinone is confirmed by the control experiment in which bound 
ubiquinone had been removed from the wild-type enzyme with Triton 
X-100 (TX-100, Puustinen et al., 1996). 
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F Modeling of the Ubiquinol Binding Site 


From the structural and biochemical evidence presented above a 
molecule of ubiquinone-2 (UQ2) was modeled into the proposed site. 
The model was aligned in such a way as to allow interaction of the 
ubiquinone ring with the polar residues. Using the program CNS 
(Adams et al., 1997) with energy minimization procedures the optimal 
position for UQ2 was obtained. The UQ2 molecule is positioned in a 
pocket formed from three membrane-spanning helices in subunit I (he- 
lices I, H and III). This site is exposed to the membrane bilayer providing 
direct access for the substrate molecule, ubiquinol. EPR measurement 
supports the modeling of UQ2, placing the ubiquinone ring near the 
surface of the phospholipid membrane at ~70° and ~15° from the plane 
of the membrane (Murray et al., 1999). The distance from the face of 
the ubiquinone ring to heme b is ~13 A permitting fast electron trans- 
fer from ubiquinol to heme b (Fig. 14, see color insert). The contact 
points that form the ubiquinone-binding site (discussed in the previ- 
ous section) can also be seen in this binding site. His-98 (contact 3) 
and the carboxylate of Asp-75 (contact 4) stabilize the oxygen atoms 
of the ubiquinone ring. A similar arrangement has been seen in the 
ubiquinone binding sites of the cytochrome bcı. In cytochrome bos, 
the distance between N,» of His-98 and O;ı of Asp-75 is 11 A, which 
is similar to the distance observed in the Q; site (10 A) of the bovine 
cytochrome bc; complex (Iwata et al., 1998). His-98 is located at the 
N-terminal end of transmembrane helix II and is exposed to the solvent 
in the periplasm, thus allowing direct proton release. The hydrophobic 
residues Ile-102 (contact 2) and also Leu-160 stabilize the ubiquinone 
ring. 

The ubiquinol binding site modeled here appears to be that of the Qu 
site. Optical spectra of the mutant enzymes show a slight redshift in the 
heme b spectrum which has also been observed in the TX-100-treated 
wild-type enzyme (no bound ubiquinol). This indicates that the loss of 
the bound ubiquinone-8 affects the spectrum of heme b and further 
suggests that the ubiquinone-binding site is relatively close to the heme, 
as also concluded by Sato-Watanabe et al. for the On site (Adelroth et al., 
1998). Mutations in this site leads to the loss of activity that could be 
related to changes in the ubiquinone binding site and/or a disruption 
of electron transfer from the site. The steady-state level of reduction of 
heme b was decreased during turnover in all mutant enzymes, which 
shows that the inhibition of enzyme activity is, indeed, due to impaired 
electron transfer from ubiquinol to the heme group. Thus, all current 
mutagenesis data suggest that the site proposed here corresponds to the 
Qu Site. 
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One of the remaining questions about ubiquinone binding in cy- 
tochrome bog is the existence of the low-affinity (Q1) site. There appears 
to be no other reasonable site for the binding of a ubiquinone in this 
structure. However, it may be possible that the site found here is alone 
sufficient for the oxidation of ubiquinol by cytochrome bo3. The binding 
site is directly exposed to the lipid bilayer, which makes the uptake of 
ubiquinol and the release of ubiquinone relatively easy. Following the 
binding of ubiquinol, an initial electron is transferred to heme b and 
two protons are released, forming the semiquinone anion. Multiple hy- 
drogen bonds or electrostatic interaction from the positively charged 
arginine residue could stabilize the semiquinone anion. The stabilized 
semiquinone could then deliver the second electron to heme b at nearly 
the same redox potential (Ingledew et al., 1995). Finally, the oxidized 
ubiquinone can exit the binding site and be replaced by the fully reduced 
ubiquinol. Further studies are required to determine whether one site 
is sufficient or whether two are essential for the function of this enzyme. 


IV. CONCLUSION 


We have summarized the structures of two quinone-binding mem- 
brane proteins in this chapter. Although there are some common 
motifs between the two quinone binding sites, they are quite differ- 
ent overall. This individuality of quinone binding sites has been further 
reinforced by the recent structures of fumarate reductase and formate 
dehydrogenase, which have different sites from any previously solved 
ones. It is interesting to know how many different types of quinone 
binding site in the membrane there are, and if it even makes sense 
to categorize them. This is a very challenging but important question 
for membrane biochemistry. In order to address this question, we are 
currently studying the quinone binding sites in a number of different 
proteins including Complex I and nitrate reductase. 
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I. INTRODUCTION 


Mechanosensitive ion channels are integral membrane proteins that 
open and close in response to mechanical stress applied either directly 
to the cell membrane (in the case of intrinsically mechanosensitive chan- 
nels) or indirectly, through forces applied to other cytoskeletal compo- 
nents (Sachs, 1997; Gillespie and Walker, 2001; Hamill and Martinac, 
2001). Cellular phenomena mediated by mechanosensitive channels 
include touch, hearing, cardiovascular tone, detection of gravity, pres- 
sure sensation, pain perception, and osmoregulation. Mechanosensi- 
tive channels are quite diverse both physiologically and structurally 
(Fig. 1) and have been discovered in all fundamental branches of 
the phylogenetic tree, Eubacteria, Eukarya, and Archaea. Eukaryotic 
mechanosensitive channels include the TRPV subclass of the Transient 
Receptor Potential channel family (Clapham et al., 2001); the TREK-1 
and TRAAK members of the two-pore domain potassium channel family 
Kop (Maingret et al., 1999a,b); and the DEG/ENaC superfamily (com- 
posed of degenerins, eptithelial sodium channels, and acid-sensing chan- 
nels (see Alvarez de la Rosa et al., 2000; Tavernarakis and Driscoll, 2001; 
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Fic. 1. Predicted membrane-spanning topology for mechanosensitive channels found 
in eukaryotes (TRPV, Kop, and DEG/ENaC channels) and bacteria (MscL and Mach). In 
addition to the transmembrane helices (represented as cylinders), other motifs present in 
these channels are designated as follows. The TRPV channels contain several cytoplasmic 
ankyrin domains (A) at the N terminus, and one pore-forming loop (P). Kop channels 
have two pore-forming loops and a self-interaction domain (SID) through which dimers 
are generated. DEG/ENaC sodium channels have a single pore-forming loop and three 
cysteine-rich domains (CRDs). 


Welsh et al., 2002), that have been implicated in the touch response of 
nematodes. Although many of these channels, including the degenerins 
and the yet-to-be identified channel receptors involved in hearing (Gil- 
lespie and Walker, 2001), appear to have an obligatory requirement for 
cytoskeletal coupling, others, such as the TREK-1 and TRAAK channels, 
exhibit intrisinic mechanosensitive activity. Unfortunately, eukaryotic 
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mechanosensitive systems tend to be complex and have yet to be well 
defined biochemically. In contrast, prokaryotic mechanosensitive chan- 
nels can be both relatively simple and intrinsically mechanosensitive. 
Consequently, bacterial channels have been much more amenable to 
biochemical, biophysical, and genetic characterization, and hence can 
serve as models for establishing basic features of mechanosensation that 
may also be relevant to the more complex eukaryotic mechanosensitive 
channels. Reviews of these prokaryotic channels from a variety of per- 
spectives may be found in Sukharev et al. (1997), Batiza et al. (1999), 
Blount et al. (1999), Oakley et al. (1999), Spencer et al. (1999), Martinac 
(2001), and Kloda and Martinac (2002a). 

Prokaryotic mechanosensitive channels have been proposed to func- 
tion in the response of a microorganism to an abrupt transition from a 
high to low osmotic strength environment (Blount et al., 1996a; Sukharev 
et al., 1997; Booth and Louis, 1999; Poolman et al., 2002). The response 
mechanisms of a bacteria to osmotic stress have been best characterized 
in Escherichia coli (see Csonka and Epstein, 1999). During downshock 
conditions, when bacteria are shifted from a high to a low osmolarity 
solution, water enters the cell and generates a large increase in turgor 
pressure. Mechanosensitive channels embedded in the plasma mem- 
branes of bacteria can respond to sudden increases in turgor pressure 
by opening under the most stressful conditions, thereby jettisoning water 
and solutes from the cytoplasm to prevent cell lysis during hypoosmotic 
shock. These channels are often nonselective and allow the passage of an 
astonishing variety of solutes across the membrane, ranging from mono- 
valent ions and water to proteins such as thioredoxin (Cruickshank et al., 
1997; Ajouz et al., 1998). Based on their distinct conductances and activa- 
tion properties, three mechanosensitive channels have been identified 
in E. coli (Martinac et al., 1987; Sukharev et al., 1994; Le Dain et al., 
1998; Levina et al., 1999): the mechanosensitive channels of large con- 
ductance (MscL), small conductance (MscS), and mini conductance 
(MscM). These channels are located in the inner membrane of the bac- 
teria (Berrier et al., 1989). As detailed below, the proteins responsible for 
the MscL and Mach activities have been identified (Sukharev et al., 1994; 
Levina et al., 1999). The presence of multiple mechanosensitive chan- 
nels apparently confers a level of redundancy on the bacterial response 
to osmotic downshock; while knockout mutants in only one of these ac- 
tivities exhibit little phenotype, mutants lacking both MscL and MscS are 
severely compromised during osmotic downshock (Levina et al., 1999). 

In this review, we first introduce some general background considera- 
tions relevant to the description of mechanosensitive channels, followed 
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by a discussion of the best characterized prokaryotic channels, MscL and 
MscS. An important objective of this analysis, still to be fully realized, 
is to identify the protein structural elements that are responsible for 
mechanosensitivity—we would like to be able to answer the question 
“What makes a mechanosensitive channel mechanosensitive?” 


A. Thermodynamic Aspects of Mechanosensitivity 


The consequences of applied membrane tension to an intrinsically 
mechanosensitive channel can be readily evaluated for a simple two- 
state system where the channel can exist in either closed (C) or open 
(O) conformations: 


CeO (1) 


Values for A G°, the standard free energy for channel opening in the ab- 
sence of applied tension, for channels such as MscL are ~40 kJ/mol 
(see Sukharev et al., 1997). If the difference in cross-sectional area, 
AA = Aopen — Actosea, between closed and open states is nonzero, then 
stretching the membrane will increasingly stabilize the state with the 
greater cross-sectional area. Typically, AA > 0, so that stretching the 
membrane opens the channel. Following Howard et al. (1988), the con- 
tribution of membrane tension, o, to the free-energy of channel open- 
ing, AG, may be included through the expression 


AG=AG’-oAA (2) 


This formalism is analogous to the treatments that can be used to incor- 
porate the effects of membrane potential and ligand concentration on 
the free energy of opening for voltage gated and ligand gated channels, 
respectively. 

For this two-state system, the ratio of channels in the closed to open 
states is given by a Boltzmann-type equilibrium expression 


(O) _ _-(A@-sAA/RT 
co. ©) 


where Ris the gas constant (8.3144 J mol"! K`!) and T is the absolute 
temperature. It should be noted that in the electrophysiology literature, 
the properties of single channels are typically under investigation, so 
that the corresponding expression addresses the energetics on a per 
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molecule basis, with the result that kg T appears in place of RT, where 
kp is the Boltzmann constant = R/N4. 

The applied tension, 01/2, required to open half the channels at equi- 
librium is equal to 


AC" 
AA 





(4) 


01/2 = 


The steepness of the response of the channel to applied tension is de- 
termined by the magnitude of the change in cross-sectional area; for 
example, in terms of units typically employed in these calculations, if 
AA = 100 Ä?, then an increase of 1 dyn cm! in membrane tension 
corresponds to a free energy change of —0.602 kJ/mol, and AG will 
scale proportionally to changes in A A. Consequently, for a channel with 
AG? = 40 kJ/mol and 01/2 = 10 dyn cm! (typical for mechanosensitive 
channels such as MscL), AA ~ 660 AN. 

A convenient experimental system for manipulating membrane ten- 
sion involves applying suction (negative pressure) in an electrophysio- 
logical patch clamp experiment, which leads to stretching and curvature 
of the membrane patch (see Sachs, 1997; Hamill and Martinac, 2001). 
For a film such as a membrane patch that separates two compartments, 
the condition of equilibrium reflects the balance between two compet- 
ing effects: the surface energy of the film, oô A, which will decrease as 
the surface area decreases, and a countering pressure differential, AP, 
across the film that will provide a work of expansion equivalent to A Pô V. 
Equating these terms for a spherical film of radius r, where 6A = 8n rdr 
and V = Anr?dr, gives the Laplace equation (Adamson, 1982) that re- 
lates these quantities. 


Apea (5) 
Yr 


Consequently, larger spheres require a smaller pressure differential to 
maintain a given membrane tension than smaller spheres, and vice versa. 
The magnitudes of the pressure differential required to generate spe- 
cific levels of membrane tension can be estimated from the following 
considerations. For a membrane with a radius of curvature = 3 um 
(typical of the dimensions in a patch clamp experiment), application of 
0.1 atm pressure (where 1 atm = 760 mm Hg = 1.013 x 10° dyn cm?) 
corresponds to o = 15 dyn cm™!. The maximum applied pressure de- 
pends on the limit at which the patches break (~20-30 dyn cm!) and 
the curvature of the membrane, but is typically ~0.2 atm. 
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B. Osmotic Pressure 


Although applying suction to a membrane patch is an experimentally 
convenient mechanism for generating pressure differentials and asso- 
ciated tension, it is not a very physiologically relevant mechanism for 
many mechanosensitive systems. For prokaryotes, a more likely situation 
involves experiencing changes in environmental conditions associated 
with changes in osmotic pressure between the outside and inside of the 
cell. These pressure differences can be substantial; for a concentration 
gradient across a membrane, AC, in mol/liter (or more precisely osmo- 
larity, Osm), the osmotic pressure difference, ATI, in atmospheres, at 
25°C is given by the relationship 


ATI = 22.4AC (6) 


so that a concentration difference of 1 M across a membrane gives an 
osmotic pressure difference of 22.4 atm. A concentration gradient of 
only ~0.01 M will produce a 0.2 atm osmotic pressure differential that 
was shown above to rupture membranes with radii ~3 um. 

The response of a bacteria to variations in the external osmolarity has 
been extensively characterized for Escherichia coli (Record et al., 1998; 
Cayley et al., 2000). While optimal growth rates occur in media of os- 
molarity ~0.3 M, cells can grow in medium varying by over 100-fold 
in external osmolarity (~0.02 M to ~3 M). A variety of osmoregula- 
tory mechanisms are utilized by E. coli to minimize the consequences of 
variability in external osmolarity (Csonka and Epstein, 1999). The first 
line of defense is provided by the peptidoglycan outer membrane that 
provides the main physical barrier against osmotically induced swelling 
and shrinking. This cell wall can stretch like a balloon under outwardly 
directed turgor pressure (Doyle and Marquis, 1994; Cayley et al., 2000), 
which is the osmotic pressure difference between the inside and out- 
side of the cell. The turgor pressure has been measured as a function 
of external osmolarity for E. coli and has been found to decrease with 
increasing growth osmolarity, ranging from ~3.1 atm at 0.03 Osm, to less 
than 0.5 atm above 0.5 Osm. The inner membrane that separates the cy- 
toplasmic and periplasmic volumes is considerably more fragile, and, as 
the preceding considerations indicate, can only withstand osmotic pres- 
sure differentials substantially below 1 atm. Consequently, the periplasm 
and cytoplasm are isoosmotic, and the turgor pressure is determined by 
the differences in osmolarities between the periplasm and surrounding 
solution. Since the outer membrane contains porins that are perme- 
able to water and small solutes, while the inner membrane contains 
aquaporins that conduct only water and a very few other solutes, the 
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cell can respond quickly to osmotic pressure differences by transport of 
water (Booth and Louis, 1999). In the case of osmotic upshock, E. coli 
responds by increasing the cytoplasmic concentrations of osmotically 
active solutes and decreasing the water concentration. Osmotic down- 
shock will be associated with a strong driving force for water influx into 
the cell; if this driving force is sufficiently large, the mechanical integrity 
of the cell will be threatened (as in the use of osmotic shock for cell lysis), 
and it is in this capacity that mechanosensitive channels are believed to 
play critical roles. 


C. Helix Packing and Gating Transitions 


For channels such as MscL, the permeation pathway for solutes 
through the membrane is lined by helices. Consequently, changes in 
the permeation pathway associated with conformational gating between 
closed and open states will require changes in helix—helix packing ar- 
rangements. These arrangements may be characterized by a minimum 
pore radius, Ro, which will be relevant to the conductance of the pore. 
For symmetric, oligomeric proteins that have the permeation pathway 
generated by one helix per subunit, Ro will depend on the number of 
helices, N, surrounding the pore, the tilt of the helix with respect to 
the membrane normal, n, and the interhelical crossing angle, a. With 
perfectly regular helices modeled as cylinders of diameter d and exact 
N-fold rotational symmetry, the variation in minimum pore radius with 
helix tilt may be shown to be (Spencer and Rees, 2002) 


Ro = San ncot (3) — J (7a) 


cosa = cos? n + sin? n cos 0 (7b) 


where 


and 0 = 27/N. The dependence of Ro on n is illustrated in Fig. 2A 
for different numbers of subunits in an oligomer. For perfectly regular 
helices and ideal oligomeric symmetry, an increase in pore radius may 
be generated by some combination of two mechanisms involving an in- 
crease in helical tilt 7 and/or or an increase in the number of helices 
N lining the permeation pathway. Although this model is idealized 
and cannot accommodate changes in the diameter of the permea- 
tion pathway due to bending or kinking of the pore forming helices 
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Fic. 2. (A) Dependence of the minimum pore radius on the helix tilt angle for ideal, 
oligomeric channels with one helix per subunit lining the permeation pathway, as calcu- 
lated from Eq. (7) for N = 3, 5, 7, and 10 (thin to thick lines, respectively), with a helical 
diameter of 9 A. The average helical tilt angle = 23° is indicated, as observed in a survey 
of membrane-spanning helices (Spencer and Rees, 2002). The points of intersections of 
the line corresponding to a pore radius = 15 A with the curves for N = 5 and 10 are 
designated by filled circles. The vertical lines from the circle highlight the tilt angles asso- 
ciated with this pore size for these two oligomeric states. (B) Histogram of the observed 
probability (Pn) distribution for the tilt angle between the axis of membrane-spanning 
helices and the bilayer normal, tabulated in 5° bins for proteins surveyed in Spencer and 
Rees (2002). The probability is calculated from the number of helices in each bin, divided 
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(see Unwin, 1995; Jiang et al., 2002a,b), it will, it is hoped, capture the 
major features of the large-scale conformational transitions observed in 
mechanosensitive channels. 

In the absence of applied tension, the axes of membrane-spanning 
helices tend to be oriented along the direction of the membrane normal 
(Bowie, 1997; Spencer and Rees, 2002), which, according to the analysis 
presented in Fig. 2A, will favor the closed state ofa channel. The observed 
distribution of helical tilt angles clusters between 0 and 40° with an 
average value of ~23° (Fig. 2B); in this range, the calculated pore radius 
of a channel (Fig. 2A) is fairly insensitive to the value 7 of the helical 
tilt angle, so that channels should be stabilized in the closed state under 
these conditions. The energetics associated with changing the helical 
tilt can be estimated, in what we hope is not too fanciful a fashion, by 
using a mean-field approximation to extract the tilt energy of helices 
relative to the membrane normal. This involves correcting the observed 
helical tilt distribution, Pops, by that expected for a completely random 
distribution PBA, and then converting this ratio to an energy through 
the relationship AG = —RT In Pops/ Ba (Fig. 2C). A linear fit to these 
data suggests that the free energy change required to increase the tilt of 
a helix from 7 = 20° to 60°, which would be associated with an increase 
in pore radius to ~15 A for N = 5, would be ~8 kJ/mol/helix, or a total 
of ~40 kJ/mol/pentameric channel. For reference, the average thermal 
energy at room temperature is ~2.5 kJ/mol. 


II. MscL: STRUCTURE AND MECHANISM 


Owing to the pioneering efforts of C. Kung, the best characterized 
mechanosensitive channel is the prokaryotic MscL, the mechanosensi- 
tive channel of large conductance. MscL was originally identified, iso- 
lated, and characterized by Kung and co-workers using a biochemical 
approach in combination with a patch-clamp assay to isolate an intrinsi- 
cally stretch activated channel from E. coli membranes (Sukharev et al., 





by the total number of helices in the sample set (139). (C) Estimation of AG for 
helix tilting evaluated from a mean-field analysis through the relationship AG = 
—RT In(Pops/ Peaic), Where Pops is taken from (B), and PA is the expected probabil- 
ity [= cos(@) — cos(0 + A@)] that a randomly chosen line will make an angle with the 
membrane normal between @ and 0 + A9, with A@ = 5° in this calculation. The line 
through these points is of no obvious theoretical significance, but represents an empiri- 
cal, linear fit with the equation AG(n) — 7.0 + 0.17, where AG and n are in kJ/mol and 
degrees, respectively. This relationship is used in the text to estimate free energy changes 
associated with helix tilting. 
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1994, 1997). Homologues of MscL have subsequently been identified 
in other bacteria (Moe et al., 1998) and have been reported in Archaea 
(Kloda and Martinac, 2001a,b,c). MscL is a nonselective ion channel of 
~3.5 nS conductance activated in vitro by the application of membrane 
tension; with a potential difference of 100 mV, this conductance is equiv- 
alent to the flow of ~109 ions/s across the membrane. For reference, 
this conductance level is ~100 times greater than more extensively char- 
acterized channels, such as the potassium channel and acetylcholine 
receptor that have conductances of ~30 pS. Kung and co-workers es- 
tablished that the MscL channel consists of a single type of subunit of 
molecular weight ~16,000 (Fig. 3A), with hydropathy analyses (Fig. 2B) 
indicating that each subunit contains two transmembrane helices. Both 
the NH: and COOH termini have been shown to be cytoplasmic (Blount 
et al., 1996b, 1999; Häse et al., 1997b). In view of the large conductance 
and the small subunit size, MscL was proposed to be organized as a ho- 
mooligomer, now known to be a pentamer (Chang et al., 1998; Sukharev 
et al., 1999a). 

The initial characterizations of the dependence of channel conduc- 
tance on membrane tension for MscL were interpreted in terms of a 
two-state model analogous to Eq. (1) (Sukharev et al., 1997). In amore de- 
tailed analysis, Sukharev, Sachs, and co-workers (Sukharev et al., 1999c) 
established the energetic parameters for the gating transition in the 
E. coli MscL (Ec_MscL) and obtained evidence for three subconduc- 
tance states (S2, S3, and S4) between the fully closed (C1) and fully 
open (O5) states: 


CIl < S2 < S3 © S4 < O5 (8) 





In an unstressed membrane, the C1 state is favored by ~40 kJ/mol rel- 
ative to the O5 state. The open probability exhibits a strong sigmoidal 
dependence on the tension applied to the membrane, with a midpoint 
of ~12 dyn cm™!. Assuming that tension favors the state with the largest 
cross-sectional area in the membrane plane, the increase in area be- 
tween Cl and O5 was calculated to be ~650 Ä?. From the conductance 
and biochemical studies, pore diameters of ~30 Å have been estimated 
(Cruickshank et al., 1997; Ajouz et al., 1998), which suggests that the 
bulk of the increased cross-sectional area of MscL in the open state cor- 
responds to formation of the pore. The functional properties of MscL 
suggest a physiological role in the regulation of osmotic pressure in 
the cell (Sukharev et al., 1997; Levina et al., 1999); since tension near 
the breaking point of the membrane is required to open the channel, 
it is believed that this channel represents a last-ditch safety valve that 
transiently places sufficiently large holes in the membrane to rapidly 
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Fıc. 3. (A) Sequence alignment of five MscL homologues prepared by the program 
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CLUSTAL-W (Thompson et al., 1997). The sequences were selected to be representative 
ofa more complete sequence alignment (see Spencer et al., 1999) and are designated as 
Tb, Ec, St, Yp, and Vc for the MscL homologues from M. tuberculosis, E. coli, Salmonella 
typhimurium, Yersinia pestis, and Vibrio cholerae, respectively. Identical residues in these 
sequences are enclosed by boxes. Residues 130-151 of Tb_MscL have no equivalents 
in the other sequences and have been omitted from this alignment. The pattern of 
conserved Ala and Gly between residues 18-28 is indicated. (B) Hydropathy analysis of 
the M. tuberculosis MscL as calculated with the program TMHMM (Krogh et al., 2001). 
The narrowly spaced vertical lines represent the probability that a given residue is found 
in a membrane spanning helix. The helical segments are predicted to include residues 
13-32 and 69-91, compared to residues 15-43 and 69-89 observed in the crystal structure 
(Chang et al., 1998). The predicted topology of the interhelical loops, either inside or 
outside the cell, is in agreement with experimental observations (Blount et al., 1996b). 


188 STROP ET AL. 


equilibrate osmotic pressure differences. Although other mechanosen- 
sitive channels with lower thresholds are present in E. coli, principally 
MscS, evidence has been obtained that demonstrates MscL does open 
under downshock conditions (Batiza et al., 2002) even when these other 
channels are present. 


A. Structure of MscL 


MscL offers many advantages for structural studies of a gated channel 
system—it has a relatively small subunit size and simple architecture, 
and prokaryotic membrane proteins are generally more easily overex- 
pressed. Although the probability of crystallizing any particular mem- 
brane protein may be low (at present), the probability of success can 
be greatly increased by screening homologous proteins with similar, but 
distinct, amino acid sequences to find one that might be more readily 
compatible with the formation ofa three-dimensional crystal lattice. The 
identification of MscL homologues in numerous prokaryotes, as origi- 
nally reported by Moe et al., (1998), provided an opportunity to search 
for appropriate crystallization conditions using naturally occurring se- 
quence variants that might be more optimal for crystal formation. This 
is the same strategy utilized by Kendrew et al. (1954) for the structure 
determination of myoglobin, except that rather than obtaining different 
types ofwhale meat from around the world, we now obtain the appropri- 
ate DNA. To facilitate this screening process, standardized expression, 
purification, and crystallization protocols were employed to streamline 
the preparation and testing of the various homologues (Spencer et al., 
2002), which culminated in the structure determination of the MscL ho- 
mologue from Mycobacterium tuberculosis (Tb-MscL) at 3.5 A resolution 
(Chang et al., 1998). 

The crystal structure of Tb_MscL established that this protein assem- 
bles as a homopentamer that is organized into two domains, the trans- 
membrane domain and the cytoplasmic domain. The transmembrane 
domain consists of 10 helices (2 per subunit) connected by an extracel- 
lular loop, while the cytoplasmic domain contains 5 helices that form a 
left-handed pentameric bundle. The sequence of the Tb_MscL subunit 
has 151 amino acids and can be further subdivided into five segments: 
the N terminus, the first transmembrane helix (TM1), an extracellular 
loop, the second transmembrane helix (TM2), and a cytoplasmic do- 
main (Fig. 4, see color insert). Each of the segments is discussed in more 
detail below. The pore is aligned along the fivefold symmetry axis and 
is formed by the first transmembrane helix (TM1) and an extracellular 
loop from each subunit. The channel has overall dimensions of approx- 
imately 85 x 50 x 50 A and both the N and C termini reside on the 
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cytoplasmic side of the membrane (Blount et al., 1996b). Unless other- 
wise indicated, the residue numbering used here is that of Tb_MscL. 


1. Amino Terminus 


Despite the fact that the first 10 residues (and the His tag) are disor- 
dered in the crystal, the high sequence conservation of the N terminus, 
together with mutagenesis studies, suggests the importance of the N ter- 
minus in channel function. Deletion of the first two residues of Ec_MscL 
or addition of residues at the N terminus makes little difference in chan- 
nel function; however, the deletion of the first nine residues is not tol- 
erated (Blount et al., 1996c; Häse et al., 1997a). More recent N-terminus 
proteolysis experiments have shown that limited N-terminal truncations 
result in functional channels with increased sensitivity to pressure (Ajouz 
et al., 2000). Sukharev and Guy, in their studies of the gating transition 
in Ec_MscL, have proposed that the N-terminal residues of each subunit 
are organized into a five helix bundle that functions as the actual gate 
(Sukharev et al., 2001a,b) (see below for further discussion). 


2. TM1 Helix 


The first transmembrane helix (residues 15-43) crosses the mem- 
brane from the cytoplasm to the periplasm and, together with the four 
TM1 helices from the other subunits, forms the permeation pathway. 
The interior of the channel on the periplasmic side is mostly hydrophilic 
and is formed by residues Thr-25, Thr-28, Thr-32, Thr-35, Lys-33, and 
Asp-36 (Fig. 4C); the abundance of polar residues toward the C-terminal 
end of the helix is responsible for the shorter predicted helix in the hy- 
dropathy analysis (Fig. 3B). The TM1 helix packs against the adjacent 
TM] helix with right-handed crossing angles of —43° and makes little 
contact with the lipid bilayer. Each TM1 helix is tilted approximately 35° 
with respect to the membrane normal, with the channel constriction 
occurring near the cytoplasmic side of the membrane. The structure 
of Tb_MsclL is believed to be in the closed or nearly closed state, since 
the channel constriction formed by the hydrophobic residues Ile-14 and 
Val-21 is about 2 A in diameter (Fig. 4D). The equivalent residues of 
the Ec_MscL also form the constriction site, based on numerous muta- 
genesis studies (Ou et al., 1998; Yoshimura et al., 1999, 2001; Moe et al., 
2000). A sequence comparison of MscL homologues reveals that many 
of the most highly conserved residues in this channel are found in TM1 
(Fig. 3A), particularly at positions involved in helix—helix contacts and 
in forming the pore constriction (Fig. 5). 

Perozo, Martinac, and co-workers (Perozo et al., 2001) have reported 
a site-directed spin-labeling (SDSL) analysis of Ec_MscL in lipid vesi- 
cles by EPR spectroscopy under physiological conditions. In a SDSL 
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Fic. 5. A helical net representation for the inner TM1 helix of Tb_MscL illustrating 
the positions of helix interface residues, pore residues, and gain-of-function mutations. 
The side chain of Val-21 serves to constrict the channel at the narrowest position. The 
correspondence between residues at the interface between the inner helices and the 
gain-of-function mutations is evident (Ou et al., 1998), suggesting this interface is critical 
for the gating process. Residues 17, 20, 24, and 28 participate in the helix interface with 
one neighboring TM] helix, while 18, 22, and 26 participate in the interface with the 
other neighboring TM1 helix; if these residues are found in subunit A, then the former 
group interacts with subunit E, and the latter with subunit B. 


experiment, as pioneered by W. Hubbell (Hubbell et al., 1998), a spin 
label is covalently attached to a series of single cysteine mutants in- 
troduced throughout the region of interest in a protein (such as a 
membrane-spanning helix). From the accessibility of the spin label to 
various reagents, and from the presence or absence of spin-spin interac- 
tion between probes, the environment and relative location of the spin 
label can be deduced. Specifically, a high accessibility of the spin label 
to the nonpolar dioxygen molecule is indicative of a hydrophobic envi- 
ronment, while exposure of the spin label to the polar species NiEdda is 
indicative of exposure to an aqueous environment. In addition, analysis 
of the spectral line shape gives information about spin label mobility. 
From the variation in these properties with sequence position, informa- 
tion on the protein structure and environment can be obtained. In the 
case of Ec_MscL, cysteine mutants were generated for residues in trans- 
membrane helices TM1 and TM2, and subsequently modified with a 
methanethiosulfonate spin label. In general, the structure and environ- 
ment of labeled residues established by this study correlated well with 
the Tb_MscL structure. In particular, severely restricted accessibility to 
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NiEdda was observed for residues 19-26, which in the crystal structure 
create the channel constriction, while strong periodicities in the mo- 
bility and accessibility of the label were seen for residues 19-38 that 
indicate a helical structure. The periodicity of NiEdda and Os accessi- 
bilities are 180° out of phase, suggesting that one face of the helix is 
water accessible, while the opposite face of the helix is in a hydropho- 
bic environment. This is in good agreement with the crystal structure, 
where one face of the TM1 helix forms the water-accessible pore and the 
opposite face packs against surrounding helices and therefore resides in 
a hydrophobic environment. 


3. Extracellular Loop 


The connection between the TM1 and TM2 helices is mediated by the 
extracellular loop formed from residues 44-68, which is poorly ordered 
in the crystal structure. This extracellular loop dips into the channel and 
lines this region of the pore to form an outer-lip of diameter ~20 A. The 
high glycine content of the extracellular loop (Gly-47, Gly-55, Gly-60, 
Gly-62, Gly-63, and Gly-64 in Tb_MscL) likely contributes to the flex- 
ibility seen in this region of the crystal structure (Fig. 4B). The high 
sequence variability and structural flexibility of this region might argue 
against an important functional role; however, proteolysis experiments 
have demonstrated that cleavage of the extracellular loops makes it eas- 
ier to open the channel (Ajouz et al., 2000), while mutagenesis studies 
have shown that substitutions in the extracellular loop can lead to a 
gain-of-function phenotype (Maurer et al., 2000). 


4. TM2 Helix 


The second transmembrane helix (residues 69-89) traverses the lipid 
bilayer from the periplasm back to the cytoplasm along the outside of 
the TM] helix bundle, and is consequently responsible for most of the 
contact with the lipid bilayer. The lipid exposed surface of MscL is com- 
posed of approximately 35% TM1 and 65% TM2. The face of TM2 that 
comes in contact with the lipid bilayer is lined with many hydrophobic 
residues (Leu-69, Leu-72, Leu-73, Ile-77, Phe-79, Phe-80, Leu-81, Phe-84, 
and Phe-88) and is more hydrophobic than an average protein core 
(Rees et al., 1989; Wallin et al., 1997; Spencer and Rees, 2002). The TM2 
helix, like the TM1 helix, is tilted about 35° with respect to the mem- 
brane normal; however, unlike the TM1 helices, the TM2 helices from 
different subunits are separated by ~20 A and do not contact each other. 

SDSL studies of the TM2 helix (Perozo et al., 2001) revealed that it is 
slightly less motionally restricted, relative to TM1. This is to be expected, 
since TM2 is less tightly packed than TM1, because of the accessibility to 
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the solvating lipid bilayer. As with TM1, the periodicity of oxygen accessi- 
bility is in general agreement with the crystal structure, where one side of 
the TM2 faces the lipid bilayer. NiEdda accessibility decreases as the TM2 
helix enters the membrane, and sharply increases at residue 100 where 
it presumably exits the membrane. The spin labeling data deviate from 
the crystal structure at the C terminus of the TM2 helix, where the local 
environment derived from EPR studies does not match the environment 
derived from the crystal structure. These differences may reflect some 
combination of the presence of partially ordered electron density near 
the C terminus of the TM2 helix (corresponding to ordered detergent 
molecules or part of the disordered polypeptide chain), structural dif- 
ferences between E. coli and M. tuberculosis homologues or differences 
arising from the use of bilayers and detergent to solubilize Eco_MscL 
and Tb_MscL, respectively. 


5. Cytoplasmic Domain 


After the TM2 helix returns to the cytoplasm, a second loop connects 
the TM2 and cytoplasmic helices. The cytoplasmic helices from all five 
subunits (residues 102-115) associate together to form a left-handed 
helix bundle. Mutagenesis and proteolysis data have shown that the 
C terminus of Ec_MscL can be removed up to residue 104 without any 
change in gating properties of MscL (Blount et al., 1996c; Hase et al., 
1997a; Ajouz et al., 1998). A cluster of charged residues (Arg-98, Lys-99, 
Lys-100, Glu-102, and Glu-104) is present in the loop connecting TM2 to 
the cytoplasmic helix (Fig. 4E) and has been shown to be important for 
channel gating (Sukharev et al., 1994, 1997; Hamill and Martinac, 2001). 
The overall negative charge of the C-terminal domain and the relatively 
low pH (3.7) used for crystal growth have led to the hypothesis that the 
cytoplasmic helix bundle is not present at physiological pH (Chang etal., 
1998); however, there is no experimental evidence for the loss of helic- 
ity at high pH (Strop, 2002), suggesting that the bundle remains intact, 
or that the helices partition into the membrane-water interface region 
under these conditions. The rest of the C terminus (residues 116-151) 
does not show any ordered structure in the crystal and corresponds to 
the least conserved region of the protein. 


B. Gating Mechanism: General Considerations 


In view of the substantial conductance and solute nonselectivity of 
MscL, the general picture that emerges for the open state of this chan- 
nel is of a large, water-filled pore with a likely diameter of up to 30 Å. 
Assuming that the fivefold symmetry of MscL is maintained in both states, 
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generation of a permeation pathway with these properties in the open 
state would require a significant rearrangement of all subunits in re- 
sponse to application of membrane tension (Blount et al., 1996a). Given 
that the membrane-spanning region of the channel is constructed from 
a-helices, a change in subunit-subunit interactions will necessarily re- 
quire changes in the packing of helices against each other. Following the 
analysis presented at the beginning of this chapter, there are two basic 
mechanisms for increasing the minimum pore radius created by a set 
of symmetrically interacting helices: either the helical tilt must increase, 
or the number of helices surrounding the permeation pathway must in- 
crease. For the specific case of MscL with n ~ 35° in the closed state, the 
transition to an open state with a minimum diameter of 30 A (radius = 
15 A) could be achieved either by increasing the tilt of the five TM1 he- 
lices to ~70°, or by incorporating the TM2 helices to create a permeation 
pathway surrounded by 10 helices, each with a tilt of ~40° (Fig. 2A). 
An important development in identifying residues critical to the gat- 
ing transition has been the isolation of “gain-of-function” mutants that 
display a slow or no-growth phenotype as a result of the leakage of so- 
lutes out of the cell under low osmotic strength growth conditions (Ou 
et al., 1998). In vitro characterization of these mutant channels demon- 
strated that they generally exhibit a reduction in the tension required 
for channel gating, suggesting that the closed state in these mutants 
is destabilized relative to the wild-type channel [i.e., the equilibrium is 
shifted to the right in Eq. (1)]. Many of the mutations associated with 
severe phenotypes are located at the interface between adjacent TM1 
helices (Fig. 5), in the region of the membrane-spanning domain where 
the pore is most restricted. Three of the residues with the most severe 
phenotypes, corresponding to positions 20, 24, and 28 of Tb_MscL, are 
typically found to be glycine in the different MscL homologs (Fig. 3A). In- 
tringuingly, glycines are frequently found at the interface between trans- 
membrane helices (Javadpour et al., 1999; Eilers et al., 2000; Russ and 
Engelman, 2000), and these residues of MscL are among the most highly 
conserved positions. An extensive mutagenesis study of residue Gly-22 
in E. coli MscL (equivalent to Ala-20 in M. tuberculosis MscL), which is at 
the interface between adjacent inner helices, revealed that hydropho- 
bic and hydrophilic substitutions stabilized the closed and open states 
of the channel, respectively, suggesting that this residue becomes ex- 
posed to a water-filled channel in the open state (Yoshimura et al., 1999). 
In experiments where cysteine residues were substituted for channel- 
constricting residues (Yoshimura et al., 2001), reversible modification of 
hydrophilicity was possible under patch clamp. Chemical modifications 
were consistent with the mutagenesis studies, resulting in functional 
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channels when hydrophobic modifying agents were used, and resulting 
in spontaneous gating when hydrophilic modifying agents were used. 
These observations suggest that contacts between TM1 helices play a 
crucial role in the gating mechanism, and, particularly, changes made 
to residues at this interface tend to destabilize the closed state and make 
it easier to open the channel. 

Although the most severe phenotypes identified in the initial “gain- 
of-function” screen were found in the membrane-spanning helices (Ou 
et al., 1998), residues in other regions throughout the protein were 
also identified that influenced the gating behavior of MscL. Proteolysis 
of the extracellular loop between the membrane-spanning helices in 
Ec_MscL has been found to significantly increase the mechanosensitivity 
of the channel, without changing conductance (Ajouz et al., 2000), 
and mutagenesis studies in this region of Tb_MscL have also picked up 
gain-of-function mutants (Maurer et al., 2000). Variants at position 56 of 
Ec_MscL, also in the extracellular loop, lead to significant alterations in 
open time (Blount et al., 1996c). On the other side of the membrane, 
deletion of either the first 12 or last 33 residues of Ec_MscL results in loss 
of channel activity, although deletions of the first 3 or last 27 residues 
can be tolerated (Blount et al., 1996c). Intriguingly, these sensitive 
regions include relatively well-conserved clusters of charged residues 
that have been suggested to be important for the gating mechanism 
(Gu et al., 1998). 

At present, most of the mutational studies have been conducted 
in E. coli, whereas the structure has only been determined for the 
M. tuberculosis MscL. The emphasis on functional studies of Ec_MscL 
reflects not only that this was the first MscL to be characterized, but also 
that opening Tb_MscL requires tensions near the breaking point of the 
patch (Moe et al., 2000), and hence Tb_MscL is technically more diffi- 
cult to study than Ec_MscL. Although the structures of these two homo- 
logues are undoubtedly similar, particularly in the membrane-spanning 
regions with the highest degree of sequence conservation, differences in 
the properties of corresponding mutations constructed in Ec_MscL and 
Tb_MscL have been reported and interpreted as indicating there may be 
mechanistic distinctions between these two homologues (Maurer et al., 
2000; Moe et al., 2000). Clearly, more extensive functional and structural 
studies are required to establish the extent of the mechanistic similarities 
in the gating of these two channels. 


C. MscL Open State 


The earliest structural models for the open state of MscL emphasized 
the involvement of all the membrane-spanning helices of MscL to create 
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the permeation pathway (Cruickshank et al., 1997; Spencer et al., 1999). 
In particular, an alteration of the TM1 and TM2 helices was suggested 
to create a 10-helix “barrel-stave” type channel with the a-helices nearly 
parallel to the membrane normal in the open state (Spencer et al., 1999). 
The conformational rearrangement associated with this transition could 
plausibly involve the coordinated movement of adjacent pairs of TM] 
and TM2 helices that are nearly antiparallel, since these types of helix- 
helix interfaces tend to be more stable (Gerstein et al., 1994). From 
the extent of buried surface area, the most stable TM1-TM2 interface 
would involve TM1 from subunit A with TM2 of the neighboring sub- 
unit B (and the symmetry equivalents) that have a antiparallel crossing 
angle of —170° and a total buried surface area of 700 Ä?. This interface 
exceeds the 540 A? surface area buried between TM1 and TM2 in the 
same subunit with a crossing angle of 138°. Attempts to generate ex- 
plicit models for a “barrel-stave” type channel have indicated, however, 
that a substantial number of hydrophobic residues would be exposed 
to water in the open state, which would be energetically unfavorable 
(Sukharev et al., 2001b). An alternative mechanism for increasing the 
number of helices surrounding the permeation pathway in the open 
state, by increasing the number of subunits in the oligomer, appears un- 
likely in view of Sukharev’s observations that the MscL oligomer is not 
in dynamic equilibrium with monomers (Sukharev et al., 1999b). 

A conceptual breakthrough in the modeling of the open state of MscL 
has been reported by Sukharev and Guy, using modeling studies com- 
bined with disulfide trapping and electrophysiological characterizations 
(Sukharev et al., 2001a,b; Betanzos et al., 2002). This model is based on 
the second general mechanism for generating the open state from the 
closed state, namely through a substantial increase in helical tilt (Fig. 6). 
The symmetry of MscL is assumed to be conserved in both closed and 
open states. The driving force for this rearrangement is likely to main- 
tain the lipid solvation of the membrane-spanning helices in response 
to a stretch-induced thinning of the bilayer (Hamill and Martinac, 2001; 
see below for a more extensive discussion of the role of hydrophobic 
mismatch in MscL gating). In this model, the structural rearrangements 
for forming a large, ~30 A pore in the open state involve increasing the 
helical tilt from 35° (closed) to 70° (open); the TM1 helices line the 
permeation pathway in both states. In the open state model, the inter- 
helical crossing angle between adjacent TM1 helices changes from —43° 
to —68°. Although both the tilt and the helix crossing angle are essen- 
tially unprecedented in structurally characterized membrane proteins 
(Fig. 2B), they are proposed to exist in a form that is significantly less 
stable than the closed state under resting conditions; the estimate from 
the mean-field approximation described in Fig. 2C suggests an energetic 
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open state model 





Fıc. 6. Top: Comparisons of the structures of MscL as observed crystallographically 
in the closed state (Chang et al., 1998), and bottom: as modeled in the open state by 
Sukharev and Guy (Sukharev et al., 2001a,b). Views down the membrane normal (right) 
and in the plane of the membrane (left) are provided. One subunit is highlighted in 
each structure for clarity. The increased tilt of the membrane-spanning helices in the 
open state model, along with the increased helical length necessary to span the bilayer 
in the open state model, is evident. The dark lines indicate 40 A. 


cost of ~6 kJ/mol/helix to increase the helical tilt by 35°. Rather than the 
side chains of Val-21 in the inner helices serving as the gate, Sukharev and 
Guy propose that the real gate is a five-helix bundle containing the highly 
conserved amino-terminal residues of each subunit. These residues are 
not observed in the crystal structure, however, presumably because of 
disorder or other types of conformational heterogeneity. Experimental 
support for the Sukharev and Guy model is provided by the ability to sta- 
bilize an open state of MscL through disulfide bond formation between 
specifically incorporated cysteines that should be spatially adjacent only 
in the open state. Disulfide cross-linking studies have demonstrated that 
by cross-linking the N terminus to the C-terminal end of the TM2 helix 
(residues 1 and 91), the channel rarely reaches the fully closed state; 
instead, it flickers mostly between the open state and an intermediately 
open substate (Sukharev et al., 2001a). 

As implemented by Sukharev and Guy, the combination of patch- 
clamp measurements with disulfide trapping provides a sensitive ap- 
proach for testing models of the open state by characterizing the 
consequences of disulfide bond formation between specific cysteines on 


PROKARYOTIC MECHANOSENSITIVE CHANNELS 19 7 


the closed-to-open transition. The ability to monitor the conductance 
of single channels is a great advantage over bulk phase measurements, 
since the effect of the presence or absence of a disulfide bond on 
channel function can be directly analyzed, even if the overall equi- 
librium strongly favors one state or the other. While the ability to 
form a disulfide bridge between two cysteines clearly requires that 
these residues be in spatial proximity at the time of bond forma- 
tion, the restrictions on the time-average structure imposed by this 
technique can be relatively weak. Since disulfide bond formation is 
an effectively irreversible trapping technique, disulfides can form be- 
tween pairs of distant cysteines if they are brought into proximity by 
even infrequently occurring conformational fluctuations. This behav- 
ior has been detailed in studies on water-soluble proteins (Careaga 
and Falke, 1992; Butler and Falke, 1996), where cysteines separated 
by up to 15 Ä in the time average structure can still be disulfide 
trapped. 

In view of these considerations, a detailed, experimentally based 
model of the open state of MscL will likely require crystallographic and 
spectroscopic approaches. The main barrier to studying the open state 
of MscL using these techniques is that they require a large number of 
molecules to obtain an experimentally recordable signal; as the equilib- 
rium strongly favors the closed state in the absence of applied tension, 
these observations will consequently reflect the properties of the closed 
state. The challenge of stabilizing the open state of MscL is shared with 
other protein systems, such as allosteric enzymes, that exist in multiple 
conformations and must be dealt with by establishing conditions where 
the alternate conformation dominates. Stabilization ofa state disfavored 
at equilibrium may be achieved, for example, through the addition of li- 
gands that bind preferentially to that state (allosteric effectors), through 
the use of mutants that stabilize the desired state (or equivalently desta- 
bilize the undesired states), or by covalently trapping the desired state. 
Unfortunately, in contrast to many medically relevant channels, no state- 
specific ligands have been identified that could be used to trap MscL in 
an open state. Mutants that favor the open state of MscL are known; while 
the equilibrium is shifted toward the open state in the gain-of-function 
mutants isolated by Kung and co-workers (Ou et al., 1998), these mu- 
tants are not locked in the open state in the absence of applied tension. 
Hence, these mutants, as isolated, are not suitable for structural studies 
of the open state. Finally, for reasons noted above, while trapping meth- 
ods could be potentially used to stabilize an open state, it is difficult to 
find conditions that will generate sufficient quantities of homogenous 
material that captures the ground-state structure of the open state. 
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Perozo and Martinac (Perozo et al., 2002a,b) have reported a major 
experimental breakthrough in the structural characterization of the 
MscL open state by using SDSL methods in conjunction with solution 
conditions that stabilize the open state at equilibrium. This advance 
was based on an analysis of potential coupling mechanisms between ap- 
plied membrane tension and the conformational state of MscL. Because 
of differences in the relative cross-sectional areas of the head-group 
and hydrocarbon-chain components of phospholipids, substantial lat- 
eral pressures are generated within a membrane bilayer (Gruner, 1985; 
Cantor, 1997, 1999; de Kruijff, 1997) that will act on integral membrane 
proteins. When a membrane is deformed, the lateral pressure profile 
will necessarily be altered, with consequent changes in the conforma- 
tional energetics of any embedded membrane protein, such as MscL, 
that possesses multiple states of differing cross-sectional area. Perozo 
and Martinac evaluated two possible mechanisms for the transduction 
of mechanical energy derived from membrane stretching to shifting the 
conformational equilibrium toward the open state of MscL: a decrease 
in bilayer thickness and an increase in bilayer curvature. 

Changes in bilayer thickness could alter the equilibrium between con- 
formational states of a channel through hydrophobic mismatch effects, 
where the ability of a membrane to solvate the nonpolar region of a 
protein will change when the membrane thickness is varied. To assess 
the contribution of bilayer thinning when the membrane is stretched to 
driving the open state transition, Ec_MscL was reconstituted into vesi- 
cles formed from phosphatidylcholine of varying acyl chain lengths 
(Perozo et al., 2002b). For chain lengths of 16, 18, and 20 carbons 
(designated PC16, PC18, and PC20, respectively), reconstituted MscL re- 
quired greater applied tension for opening with increasing chain length, 
with AG° values measured to be 10, 24, and 59 kJ/mol, respectively. 
Despite the differences in pressure dependence, the conductances of 
the channel were essentially constant under all conditions. However, 
under no condition could the equilibrium be shifted to predominantly 
favor the open state, a result corroborated by SDSL studies with MscL 
containing spin labels positioned near the constriction in TM] recon- 
stituted into vesicles composed of PC10 to PC20. Thus, while bilayer 
thinning/hydrophobic mismatch effects do influence the energetics of 
MscL gating, they are insufficient to preferentially stabilize the open state 
under equilibrium conditions. However, evidence for a distinct, closed 
state form of MscL was observed in vesicles reconstituted with PC14. 

The possibility that changes in bilayer curvature might be implicated 
in the gating of mechanosensitive channels was motivated by the obser- 
vation that addition of amphiphiles to one side of the membrane can 
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lower the activation tension required for channel opening (Martinac 
et al., 1990). The basic idea is that if non-cylindrically shaped molecules, 
such as the cone-shaped amphiphile lysophosphatidylcholine (LPC) that 
contains a “big head” relative to the small acyl chain, are incorporated 
asymmetrically into membranes, geometric considerations will lead to 
mechanical deformation (curvature) of the bilayer to optimize packing 
throughout the membrane leaflet. This induced curvature will necessar- 
ily alter the lateral pressure profile throughout the membrane, which 
will consequently perturb the conformational equilibrium of proteins 
exhibiting multiple states of different cross-sectional areas. Using SDSL 
probes incorporated into residues near the constricted region of TMI, 
Perozo and Martinac found that an open state conformation of MscL 
could be stabilized under equilibrium conditions by the addition of 
~10-25 mol% LPC from solution to the externally facing leaflet of 
the bilayer. 

By establishing conditions that stabilize the open state of MscL, a sys- 
tematic analysis of the conformation and environment of residues in the 
TM1 and TM2 helices could be conducted using the SDSL approach 
(Perozo et al., 2002a). The results of this study clearly establish that 
(1) the permeation pathway is lined primarily, if not exclusively, by 
residues in the TM1 helix; (2) the TM2 helix remains in contact with the 
lipid bilayer, but the interactions between TM2 and TM] shield the latter 
from the extensive interactions with the membrane; and (3) equivalent 
residues in the TM] helices of different subunits move sufficiently far 
apart that they no longer exhibit spin-spin coupling that is apparent 
for residues near the channel constriction in the closed state. A struc- 
tural model for the MscL open state has been generated that is in good 
agreement with the constraints established by the SDSL analysis, which 
indicates that the pore diameter is >25 A. As anticipated by the Sukharev 
and Guy model (Fig. 6), the open state is derived from the closed state by 
a substantial increase in tilt of the TM1 and TM2 helices that maintains 
the right-handed packing interactions between TM1s. Further develop- 
ments in the structural characterization of this system, both within and 
external to the membrane bilayer, will be very informative in defining 
the gating mechanism of MscL. 


D. Computational Studies 


In addition to experimental studies, the combination of relative sim- 
plicity and structural characterization have made MscL an attractive 
target for computational molecular dynamic simulations of channel gat- 
ing (Elmore and Dougherty, 2001; Gullingsrud et al., 2001; Bilston and 
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Mylvaganam, 2002; Kong et al., 2002). These studies have highlighted 
regions of relative structural stability found in the membrane, particu- 
larly near the channel constriction, as well as more poorly defined re- 
gions involving residues in the extracellular loop and carboxy terminus. 
Calculations on the closed state of wild-type MscL have been extended to 
include evaluation of the sensitivity of the channel structure and dynam- 
ics to mutations, and, of particular mechanistic interest, the response 
of the channel to applied tension (Gullingsrud et al., 2001; Bilston and 
Mylvaganam, 2002). In one study, the trajectory of closed to open state 
developed by Sukharev and Guy has been modeled by targeted molec- 
ular dynamics (Kong et al., 2002), which suggests that the intermediate 
open states are asymmetric. Computational analyses would seem to hold 
particular potential for establishing the coupling between membrane 
and protein structures as a function of applied tension, and in struc- 
turally defining the progression of intermediates between closed and 
open states. 


III. Mech AND OTHER PROKARYOTIC MECHANOSENSITIVE CHANNELS 


A. MscS 


Following the identification of E. coli MscS in 1987 (Martinac 
et al., 1987), this channel has been electrophysiologically characterized 
(Sukharev et al., 1997; Blount et al., 1999) and shown to have a con- 
ductance of approximately 1 nS, roughly one-third that of MscL, with 
a pressure threshold for channel opening approximately 60% that of 
MscL. From a fit to the Boltzmann equation [Eq. (3)] the closed-to- 
open transition for Mech may be characterized by AG? = 28 kJ/mol, 
AA = 840 Ä?, and the midpoint tension is 5-6 dyne cm! (Sukharev, 
2002). From the observed conductance, a pore diameter of approx- 
imately 18 A is estimated for the open state of MscS. Unlike the MscL 
channel, MscS exhibits a voltage-dependent gating. On reaching its pres- 
sure threshold, there is an e-fold increase in open probability per 15 mV 
of membrane depolarization (positive voltage) (Martinac et al., 1987). 
At moderate voltages (+20 mV), MscS has slow kinetics, dwelling in an 
open or closed state for up to seconds (Sukharev et al., 1997). MscS is 
mostly nonselective with a slight anion selectivity (Kloda and Martinac, 
2002a; Sukharev, 2002). 

Booth and co-workers established that the yggB gene was associated 
with MscS activity in membrane patches of E. coli (Levina et al., 1999). 
As shown by a series of yggB knockout strains of E. coli, the yggB gene 


PROKARYOTIC MECHANOSENSITIVE CHANNELS 201 


was required for MscS channel activity. Whereas deletion of either the 
mscL or yggB gene appears to be without a phenotype, mutants lacking 
both genes are hypersensitive to osmotic downshock. This finding sug- 
gested that each channel can compensate for the other in responding 
to osmotic stress. KefA [subsequently renamed Mack (Li et al., 2002)], 
a protein found by homology to contain an MscS-like domain, also dis- 
plays MscS-like activity, but is dispensable. Deletion of the kefA gene in 
concert with either the mscS gene or the MscL gene was found not to 
have any phenotype. Although it appears that kefA is not required for 
the response to hypoosmotic shock, it contains a domain highly homolo- 
gous to yggB at the carboxy-terminal end of the protein and does encode 
a channel of similar conductance to MscS (~1.0 nS). In contrast to the 
slight anionic preference observed for the MscS activity associated with 
yggB, a mutational analysis has suggested that KefA is a cation-specific 
channel involved in osmotic adaptation in E coli (McLaggan et al., 2002). 

Whereas homologues of MscL range from around 130 to 150 amino 
acids, MscS homologues are typically at least twice that size (Fig. 7A). By 
separating E. coli MscS and MscL channels on a size exclusion column 
and subsequently analyzing the resulting fractions by patch clamp, it was 
demonstrated that MscS and MscL activities can be resolved (Sukharev 
et al., 1993). MscS elutes from the column first, indicating that the func- 
tional channel is larger than MscL. Based on the size exclusion chro- 
matography, the fully assembled MscS channel was estimated to have a 
molecular weight of 200,000-400,000, in comparison to 70,000 for MscL. 
From these studies, MscS is expected to be a homomultimer (Sukharev 
et al., 1993), with cross-linking data suggestive of a hexamer (Sukharev, 
2002). 

Hydropathy plots of MscS indicate that the N-terminal half of the pro- 
tein is highly hydrophobic (Fig. 7B). Transmembrane helix predictions 
programs such as TMHMM (Krogh et al., 2001) predict three transmem- 
brane helices for E. coli MscS (Fig. 7B), followed by a large (160 amino 
acids) soluble domain that is predicted to be cytoplasmic on the basis of 
PhoA fusion studies on KefA (McLaggan et al., 2002) and from the se- 
quence properties identified in the hydropathy analysis (Fig. 7B). Predic- 
tions for the larger Mach proteins from Bacillus subtilis and Methanococcus 
jannaschii suggest two additional transmembrane helices are present be- 
yond those in E. coli MscS. From sequence comparisons (Fig. 7A), regions 
with a high degree of sequence conservation are found throughout the 
entire yggB protein. Itis striking, however, that residues in the region cor- 
responding to the third predicted transmembrane helix are especially 
well conserved. It has not been established which membrane-spanning 
helix forms the permeation pathway, although mutants that alter the 
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channel properties have been selected in the first (Okada et al., 2002) 
and last (McLaggan et al., 2002) membrane-spanning helices of YggB 
and KefA, respectively. 


B. MscM 


A third mechanosensitive channel of mini conductance (MscM) has 
been functionally identified in E. coli (Berrier et al., 1989, 1996), al- 
though the encoding gene has not yet been reported. This channel has 
a conductance of less than half that of MscS (0.1-0.4 nS), with slow 
opening/ closing kinetics relative to Mach, and especially MscL (Berrier 
etal., 1993). In contrast to MscL and Mach, MscM has pronounced cation 
selectivity. 


C. Archaeal MscS-like Proteins 


Once the genes encoding MscL and MscS were identified, it became 
possible to search genomic databases for homologous channels in a 
wide array of prokaryotes. The results of these sequence searches sug- 
gest that MscS may be more widely distributed in prokaryotes than MscL. 
Unlike MscL, Mach homologues appear to be prevalent in Archaea as well 
as in at least some eukaryotes. Kloda and Martinac (2001b,c, 2002a) 
identified and characterized two archaeal mechanosensitive channels 
that were reported to be similar to both MscS and MscL. Using the 
first transmembrane helix of MscL as a sequence probe, they identi- 
fied an apparently homologous channel in the archaea Methanococcus 
jannaschii, termed MscMJ. Using this channel as a search criterion, 
they subsequently found a second mechanosensitive channel MscMJLR. 
Although the original search was performed using TM] of MscL, both 
channels are homologous to Mach, On the basis of these sequence simi- 
larities, prokaryotic mechanosensitive channels have been proposed to 
be evolutionarily related (Kloda and Martinac, 2002b), although this 





Fic. 7. (A) Sequence alignment of four MscS homologues prepared by the program 
CLUSTAL-W (Thompson et al., 1997). The sequences correspond to MscS homologues 
from the organisms included in Fig. 3A for MscL, with the exception of M. tuberculosis, 
where a MscS homologue has not been identified. Identical residues in these sequences 
are enclosed by boxes. The pattern of highly conserved Ala and Gly between residues 
84 and 121 is indicated. (B) Hydropathy analysis of the E. coli MscS as calculated with 
the program TMHMM (Krogh et al., 2001). The narrowly spaced vertical lines represent 
the probability that a given residues is found in a membrane-spanning helix. The helical 
segments are predicted to include residues 22-44, 65-87, and 102-124, with the N and 
C termini positioned in the periplasm and cytoplasm, respectively. 
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interpretation has been questioned (Okada et al., 2002). MscM] was 
found to have a conductance of approximately 0.3 nS and was activated 
over a negative pressure range of 25-80 mmHg. In contrast, MscMJLR 
was found to have a much larger conductance of 2.0 nS, more similar to 
MscL than to Mach, but which opened at a lower and narrower negative 
pressure range similar to MscS. This indicates that MscMJ and MscMJLR 
are not strict functional homologues of MscL and Mach, but instead 
exhibit characteristics of both channels. 


IV. WHAT MAKES A MECHANOSENSITIVE CHANNEL MECHANOSENSITIVE? 


As evident from the functional and structural diversity of mechanosen- 
sitive channels (Fig. 1), there will be no unique answer to this question. 
Part of the challenge in characterizing mechanosensitive channels is 
that there have been no obvious clues in the protein sequences to in- 
dicate whether or not a particular protein may be mechanosensitive. 
In contrast to voltage gated potassium channels, which contain char- 
acteristic sequence motifs such as “GYG” in the selectivity filter or the 
pattern of positively charged residues in the S4 voltage sensor, compa- 
rable signature sequences have not yet been identified in mechanosen- 
sitive channels. Consequently, the mechanosensitivity of a channel can 
only be identified through functional characterization, at least for the 
first member of a family. In the case of prokaryotic channels such as 
MscL and MscS that are intrisincally mechanosensitive, however, basic 
features of this process are emerging that may reveal clues hidden in 
the amino acid sequences of these proteins relevant for mechanosen- 
sation. The substantial conductance of these channels requires a large 
pore diameter in the open state that in turn will require substantial con- 
formational rearrangements during the transition from the closed state. 
As envisioned in general models for mechanogated channels (Howard 
et al., 1988), the open state of MscL has an increased cross-sectional area 
relative to the closed state. Experimental studies have clearly established 
that the transition from the closed to the open state is associated with 
increased tilts for the membrane-spanning helix that require significant 
rearrangements in the helix—helix interfaces (Sukharev et al., 2001a; 
Perozo et al., 2002a). Consequently, an essential property of MscL must 
be the ability to stably adopt at least two distinct conformations with 
different helix packing interactions. 

As discussed earlier in this chapter, residues at the interfaces between 
adjacent TM1 helices of MscL are relatively well conserved. These inter- 
face residues tend to occur along two stripes that run along the i, i+ 4 
grooves on opposite sides of the helices (Fig. 5), which is consistent with 
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the packing of TM1 helices in a right-handed bundle with a crossing 
angle of ~ —40° (Chothia et al., 1981). Residues found occurring in 
these positions of TM] are generally small, specifically Gly and Ala, as is 
typical for the interface between membrane-spanning helices, and are 
relatively well conserved in MscL. In the model for the transition between 
the closed to open states developed by Sukharev and Guy (Sukharev etal., 
2001b), adjacent helices slide against one another through contacts me- 
diated by residues positioned along the i, i+ 4 direction. Consequently, 
residues at these positions in the TM1-TM1 interface must be compati- 
ble with helix packing interactions in both closed and open states. The 
use of small amino acids, particularly Gly and Ala, at these positions may 
facilitate the interconversion between conductance states. Examination 
of the MscL sequence alignments (Fig. 3A) shows some tendency for Gly 
and Ala to alternate at every second residue between positions 18 and 
28 to give the pattern AxGxxxGxAxG. The spacing is such that the Glys 
form an interface with one neighboring TM1 helix, while the Alas par- 
ticipate in the interface with the other neighboring TM]. Although the 
pattern GxxxG tends to be relatively common in membrane-spanning 
helices (Russ and Engelman, 2000), with the glycines positioned at the 
interface between helices, and although Ala is often commonly found in 
membrane-spanning helices, the combination GxAxG does not appear 
to be especially frequent (Senes et al., 2000), suggesting the presence of 
this motif in MscL may be functionally relevant. 

Intriguingly, the amino acid sequence of the third predicted trans- 
membrane helix of MscS exhibits similar patterns to those present in 
TM1 of MscL (Levina et al., 1999). In particular, there is a striking 
pattern of conserved Gly and Ala residues in the extended region be- 
tween residues 84-121 of the E. coli yggB gene product, with sequence 
AAxGxxGxxxAxxxAxxGAAGxAxGxAxxGxxxxxAAG. The tendency of 
Alas to appear every fourth residue, particularly between positions 94— 
113, is quite noticeable, while the pattern involving Gly is more vari- 
able. These residues are present in one of the most strongly conserved 
stretches of sequence in MscS, and the properties are suggestive of partic- 
ipation in helix-helix interactions that may be relevant to mechanosen- 
sitive channel function. 


V. CONCLUDING REMARKS 


The past few years have witnessed significant advances in establishing 
the gating mechanism of prokaryotic mechanosensitive channels, par- 
ticularly the recent characterization of the open state of MscL (Sukharev 
et al., 2001a; Perozo et al., 2002a) and the identification and initial 
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characterization of the protein responsible for MscS activity (Levina 
et al., 1999). Although biochemically more challenging, steady and 
increasing progress is being made in the identification and charac- 
terization of eukaryotic mechanosensitive channels (see Gillespie and 
Walker, 2001), including those that function through coupling to the 
cytoskeleton. Although an understanding in molecular detail of a com- 
plex mechosensory process such as hearing is still rather far off in the 
future, substantial progress has been made in deciphering basic features 
of mechanosensitive channels involved in simpler systems that will, it is 
hoped, reveal shortcuts towards this goal. 
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NOTE ADDED IN PROOF 


The crystal structure of the Escherichia coli MscS has been determined at 3.9 A resolu- 
tion [Bass, R. B., Strop, P., Barclay, M. and Rees, D. C. (2002). Science 298, 1582-1587.] 
and demonstrates that this channel folds as a membrane-spanning heptamer with a large, 
cytoplasmic region. Each subunit contains three transmembrane (TM) helices, with the 
TMB helices lining the pore, while TM1 and TM2 are likely candidates for the tension 
and voltage sensors. The transmembrane pore, apparently captured in an open state, 
leads into a large chamber within the cytoplasmic region, that connects to the cytoplasm 
through openings that may serve to filter out impermeant species. 
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I. INTRODUCTION 


Ion channels are specialized membrane proteins that reduce signifi- 
cantly the energy barrier ofion conduction across membranes. A region 
called the pore is where ion conduction occurs and the flux is pro- 
portional to the electrochemical gradient between the two sides of the 
membrane. The easiness of ion permeation or conductance of the chan- 
nel and the selection of the preferred types of ions that can go through 
the channel or selectivity are two properties of the pore region of the ion 
channel. The pore may be open in several different conductance states 
or altogether closed and the process that controls these conductance 
states is called gating. 

In this chapter we will not address ion conduction proper but we 
will concentrate on two aspects of ion channels: the gating process and 
the transduction from the initiating stimulus to the operation of the 
gate. The gating process is the operation of the gate proper while the 
transducer may be considered the sensor of the stimulus that ultimately 
opens and closes the channel. Specifically, we will address one type of 
sensor: the membrane potential or voltage sensor. 

Our understanding of voltage sensing and gating has been the result 
of a large number of physiological and biophysical observations that 
were initiated by the pioneering work of Hodgkin and Huxley (1952) in 
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a large variety of ion channels from squid axon sodium and potassium 
channels to potassium channels from bacteria. The correlation between 
structure and function has benefited especially from the possibility of 
expressing ion channels in heterologous expression systems that allows 
the study of specially engineered channels where specific sites can be 
tested with EPR or fluorescent probes. 

The description of structure-function relations of the voltage sensor 
will be based on experiments performed in the Shaker K channel, the 
first cloned potassium channel from Drosophila melanogaster, or the rat 
muscle sodium channel. At present, there is no three-dimensional struc- 
ture of voltage-dependent channels; therefore the experiments are a 
combination of site-directed mutagenesis, cysteine scanning, and prob- 
ing that, studied simultaneously with the function, helps in inferring the 
major conformational changes of the channel protein. Although some 
of these measurements can be very precise using optical techniques, 
most of these inferences are indirect and they allow only a hypothet- 
ical picture of channel operation that will be confirmed only when a 
high-resolution structure becomes available. 

The pore and its gates have also been studied using approaches sim- 
ilar to those outlined for the voltage sensor. However, the availability 
of the high-resolution crystal structures from prokaryotic Kt channels, 
KcsA and MthK (Doyle et al., 1998a; Jiang et al., 2002a,b), has provided a 
solid basis for understanding conformational changes with structural 
dynamic techniques such as EPR. The movements of the gates, de- 
scribed in the second part of this chapter, can then be considered as 
real conformational changes in the atomic structure of the channel 
pore. 


U. THE VOLTAGE SENSOR 


Voltage-dependent channels, such as the classical sodium or potassium 
channels in nerve tissue, change their conductance with membrane po- 
tential. The changes in conductance are a very steep function of mem- 
brane voltage: conductance values can increase as much as 150 times for 
an increment of 10 mV in membrane potential (Hodgkin and Huxley, 
1952). 

The ion current through the channel is the result of flow through an 
open pore under the influence of the electrochemical gradient. Thus the 
effective conductance of the channel depends on the open channel con- 
ductance and the fractional time the channel spends in the open state, 
also called open probability (P,). By recording the currents through 
single channels it is possible to measure the voltage dependence of the 
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conductance of the open pore, and it has been found to be weakly volt- 
age dependent. This means that the high voltage dependence of the 
conductance originates from the voltage dependence of the open prob- 
ability. As the gating of the channel controls the open probability our 
understanding of the voltage dependence will involve the description 
of the mechanisms that sense the membrane potential and couple the 
sensor to the operation of the gate. 

If voltage changes regulate gating the basic question is how voltage 
is detected by the channel. The two obvious candidates are electric or 
magnetic charges or dipoles that can reorient in the electric field. We will 
concentrate on electric charges because there is no evidence for their 
magnetic counterparts in channel proteins. The inescapable conclusion 
is that a change of position of charges or reorientation of dipoles will be 
a required mechanism to sense voltage changes (Hodgkin and Huxley, 
1952). 

Early evidence of the presence of the gating charge was obtained 
by recording the current produced by their displacement as a result 
of a change in membrane potential. These currents, which are tran- 
sient in nature, were called gating currents (see Fig. 1). The kinetics and 
steady-state properties of these currents gave information on channel 
states which were not directly measured by the ionic current (Fig. 1). 
This is because in the process of channel opening there are multi- 
ple states which are nonconducting preceding the conducting state(s) 
and the transition of the voltage sensor between these states moved 
charge, thus contributing to the gating currents. This is summarized 
in Fig. 1B. The gating charge Q, which corresponds to the time inte- 
gral of the gating currents, has a voltage dependence different from 
the open probability (P, in Fig. 1B) such that there is a region of volt- 
age where there is sizable movement of the charge with little or un- 
detectable open probability (Fig. 1B). This basic experimental obser- 
vation has drawn much attention and many kinetic models have been 
presented that can explain the basic features of Q and Py. However, 
the focus in this review will be on the structural basis of these observa- 
tions with the long-term goal of formulating physical models of channel 
operation. 


A. Charge Required to Open the Channel 


The large voltage dependence of these voltage dependent channels 
dictate that a large amount of charge move in the membrane electric 
field. Typically, at negative (inside) membrane potential, the channel is 
mostly closed and at positive potentials is mostly open. Then, the open 
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Fic. 1. (A) Recordings of ionic currents and gating currents of Shaker K channel 
with fast inactivation removed. The holding potential was —90 mV and the steps range 
from —120 to 0 mV in increments of 10 mV. The gating currents were recorded from 
a mutant that does not have pore conduction (W434¥F, Perozo et al., 1993). Notice that 
ionic currents are only visible with pulses that are more positive than —60 mV. (B) The 
voltage dependence of the integral of the gating current or gating charge (Q) and the 
open probability of the channel (P,). 
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probability P, is a sigmoid function of membrane potential that can be 
roughly approximated by a Boltzmann distribution. One can ask how 
much charge is necessary to account for the voltage dependence. This 
question can be answered by first defining the activation potential (W,) 
which is a measure of the electrical energy necessary to open the channel: 
Wa = —kT In Po. The mean activation charge (qa) is next defined as the 
negative gradient of W, or 


dln P, 
dV 


(Almers, 1978; Sigg and Bezanilla, 1997). Now we need to relate this 
mean activation charge with the gating charge displacement. We first 
find the distribution of open probability Po, knowing the energies F; of 
each one of the states 7 along the activation pathway where the reaction 
coordinate is the gating charge displacement q. Then, by assuming that 
the energy F; = G — q V, where G represents the energy that depends 
on thermodynamic variables other than voltage (V), one obtains the 
derivative of In P, with respect to V and thus obtains a relation between 
(qa) and q and V (Sigg and Bezanilla, 1997). If the channel has no charge 
movement between the open states, the total charge per channel is equal 
to 


(qa) = kT 








3 dln Po 
Qmax = lim kT 
V—>—00 dV 

The total gating charge Qmax can also be estimated by measuring the 
total gating charge Q and the total number of channels N and then 
dividing Q by N. However, in this case the estimation of Qmax may also 
contain contributions of charge that is not thermodynamically coupled 
to the opening of the channel. These two types of measurements have 
been done in the Shaker K channel, and remarkable agreement has 
been found between the two methods, indicating that there is very lit- 
tle or no peripheral charge (Fig. 2). It is important to notice that the 
value of Qmax corresponds to the charge that traverses the whole elec- 
tric field of the membrane. If, for example, the movement is only half 
of the total electric field then the actual charge must be double to ac- 
count for the same amount of Qmax. We should then consider Q max 
as the total electric charge times the fraction of the field it traverses. 


B. The Residues That Make Up the Gating Charge 


In Shaker the value of Qmax is about 13 e per channel (Schoppa et al., 
1992). In the skeletal muscle Na channel the value of Qmax was 12 e 
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Fic. 2. Top: The neutralization of some residues in the S2, S3, and S4 segments de- 
crease the gating charge. Each bar represents the charge measured after neutralization 
by the limiting slope method (empty bars) or the Q/N method (filled bars). ShB-IR rep- 
resents the wild-type channel. Data from Seoh et al. (1996) and Aggarwal and MacKinnon 
(1996). Bottom: Topology of the transmembrane segments of Shaker K channel. Four 
subunits are required for channel function. The highlighted residues have been proved 
to be part of the gating charge. In sodium and calcium channels the four subunits are 
strung together in one polypeptide with four domains that are homologous among them 
but not identical to each other. 


as estimated with limiting slope using single-channel measurements 
(Hirschberg et al., 1996). What structure within the protein can ac- 
count for such large values of charge displacement? The simple tilt- 
ing of alpha helices may not be enough because each helix of the 
length of the membrane thickness is equivalent to a dipole separating 
0.5 e, and as there are a total of 24 transmembrane segments they all 
would have to make a substantial conformational change. It is more 
likely that either acidic residues or basic residues bearing a permanent 
charge may be oriented and translocated by the change in the electric 
field. 

When the first voltage-dependent ion channel was cloned and se- 
quenced, it was recognized that the fourth transmembrane segment 
had a string of basic amino acids organized as one every three residues 
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was separated by hydrophobic residues. This pattern was found in 
voltage-dependent sodium, potassium, and calcium channels. In ad- 
dition, the second and third transmembrane segments had conserved 
acidic residues. These charges are shown in the lower half of Fig. 2 that 
depicts the transmembrane segments of the Shaker potassium channel. 
Four subunits are necessary to assemble a functional channel. In the 
case of the sodium and calcium channels a long polypeptide codes for 
the whole pore-forming subunit, but it has four repeats or domains that 
are similar to the four subunits of the potassium channel. The proof that 
these basic and acidic amino acids were responsible for the gating charge 
was obtained when the neutralization of a particular residue was shown 
to decrease the total charge per channel (Aggarwal and MacKinnon, 
1996; Seoh et al., 1996). Thus, the first four most extracellular basic 
residues of the S4 segment and the most intracellular acidic residue of 
the S2 segment of Shaker were found to contribute to the gating currents 
(see Fig. 2). If these five charges move across the whole electric field, 
the total charge per channel would be more than satisfied. The simplest 
interpretation is that the charges traverse a fraction of the electric field 
giving a total contribution of about 13 e for the total movement. 


C. Movement of Charged Residues with Membrane Voltage 


From the structural point of view, the movement of the voltage sen- 
sor is quite challenging because it needs to transport a large amount of 
charge across the membrane field. The first problem is the stability of 
the S4 segment with its basic residues, presumably all positively charged 
and embedded in the protein matrix. Initial hypotheses were that these 
basic residues were forming salt bridges with acidic residues, thus lower- 
ing the conformational energy to reasonable values. In this hypothesis, 
the change in the electric field would break these salt bridges allowing 
the basic residues to translocate to the other side of the membrane, pro- 
ducing the charge movement that could be recorded as gating current. 

Papazian et al. (1995) found that in fact the basic residue K374 (the 
fifth charge of the S4 segment in Shaker) forms a network of charges 
with acidic residues E293 from S2 and D316 from S3. Disruption of 
this network prevents normal protein folding. It is important to remem- 
ber that K374 does not contribute much charge to the gating currents. 
In addition, Tiwari-Woodruff et al. (2000) found another network that 
operates between R368 and R371 (the third and fourth charge of the 
S4 segment) and the acidic residue E283 of the S2 segment. However, 
there is no evidence that R362 and R365 (first and second charges of the 
S4 segments) pair with any acidic group. 


218 BEZANILLA AND PEROZO 


The translocation of the basic residues of the S4 segment in response 
to voltage changes has been a matter of intense study. The techniques 
used have all been indirect because no three-dimensional structure in- 
formation is available. We could classify the techniques into three broad 
categories. The first is the use of reactive agents against an engineered 
residue in the S4 segment; the second is the use of site-directed fluores- 
cent probes to test conformational changes (Mannuzzu et al., 1996; Cha 
and Bezanilla, 1997, 1998; Loots and Isacoff, 1998); and the third is the 
use of fluorescence resonance energy transfer (FRET) to measure ac- 
tual distances and changes in distances between fluorophores inserted 
in the structure of the voltage sensor. The three types of techniques 
have given complementary information that has started to delineate a 
possible mechanism of operation of the voltage sensor. 


1. Testing the Exposure of Residues with Probes 


The use of cysteine reactive probes allows the testing of exposure of 
residues replaced by cysteine from either the inside or the outside ofthe 
membrane and as a function of the state of the sensor. The technique 
requires a mutation of the residue under study to a cysteine. Then, by 
applying a cysteine modifying reagent that on reaction changes the elec- 
trophysiological properties it is possible to test whether the reaction is 
occurring from the inside or the outside by using nonpermeant mod- 
ifying reagents. The test is done in the hyperpolarized or depolarized 
membrane, thus probing the accessibility of the residue to the reagent 
mainly in two different conformations of the channel molecule. These 
experiments have been carried out in the S4 segment of the fourth do- 
main of the skeletal muscle sodium channel and in the S4 segment of 
Shaker Kchannel (Larsson et al. 1996; Baker et al., 1998; Yang and Horn, 
1995; Yusaf et al., 1996). The results show that indeed several charged 
residues of the S4 segment could be exposed to the inside when the 
membrane is hyperpolarized or to the outside when it is depolarized. 
However, some residues may be exposed to one side in one state and 
buried in the other state, indicating that they do not change sides when 
the membrane potential is changed. 

The other method consists of a replacement of the basic residues by 
histidine (Starace et al., 1997; Starace and Bezanilla, 2001). The ratio- 
nale is that histidine can be titrated by pH and the pk, is close to the 
physiological pH allowing a range of Ht concentrations that can span 
from full loading of the histidine—thus bearing a positive charge—to 
full unloading making it a neutral residue. Histidine scanning has sev- 
eral advantages over cysteine scanning when dealing with the charges 
of the sensor. First, it is possible to conserve or eliminate the charge 
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of the basic residue under study, and second, the probe is a proton, 
which is much smaller than cysteine reagents, allowing the probing of 
exposure to restricted aqueous crevices. These experiments have been 
carried outin anonconducting mutant of Shaker (mutation in the pore 
region, W434F; Perozo et al., 1993) because it requires recording gating 
currents to detect the proton accessibility under a proton gradient across 
the membrane. 

If the histidine residue alternates exposure sides in response to mem- 
brane potential changes, one can observe an asymmetry in the gating 
current between the On and Off response. This is due to the fact that his- 
tidine is positively charged in the low pH solution but is uncharged in the 
high pH solution, introducing an asymmetry in the transported charge 
depending on the direction of movement of the sensor. In addition, if 
the membrane potential is in the range where the P, is near 0.5, the 
sensor makes frequent transitions between the two extreme positions; 
therefore there will be a net current because each excursion of the voltage 
sensor from the low pH side to the high pH side will transport a proton, 
but none will be transported in the opposite direction. This current has a 
bell-shaped voltage dependence because it tends to decrease toward the 
extreme potentials at which the voltage sensor makes infrequent excur- 
sions between the two states. This voltage sensor-driven proton current 
was observed for the second, third, and fourth charges (R365H, R368H, 
R371H) of the S4 segment of Shaker showing that these three residues 
can be alternately exposed to either side depending on the state of the 
channel (see Fig. 3). 

For the first charge (R362H), the observation was that there is a proton 
current that is not bell-shaped but increases monotonically as the mem- 
brane is hyperpolarized. This proton current has all the characteristics 


362H |) GQ] Accessible from BOTH sides (H* channel) 

365H |) GQ) Accessible from BOTH sides (H+ transport) 

368H EL Accessible from BOTH sides (H+ transport) 

371H | QQ) Accessible from BOTH sides (H+ transport) 
(H* channel at positive potentials) 

374H EC) NOT titratable 

3778 Ba) NOT titratable 


Fic. 3. Summary of the results of histidine scanning in Shaker K channel. When 
replaced by histidine, the first four basic residues (R362 through R371) translocate com- 
pletely from intracellular to extracellular while the next two residues (K374 and R377) 
are not titratable. 


220 BEZANILLA AND PEROZO 


of an ionic current through a channel, as opposed to the other case 
that represents a typical transporter. The interpretation of this result is 
that in the hyperpolarized condition the histidine in position 362 acts 
like a bridge between the inside and outside allowing protons to circu- 
late according to their electrochemical gradient using the histidine as 
a selectivity filter. The implication of this interpretation is that in the 
hyperpolarized state the electric field is concentrated in an extremely 
narrow region in the vicinity of residue 362 (Starace and Bezanilla, 1999). 
When the fourth charge is replaced by histidine (R371H) it forms a pro- 
ton transporter; however at large depolarizations it seems to also forma 
proton channel as is the case for R362H (Starace and Bezanilla, 2001). 
This may be interpreted as the formation of a narrow constricted region 
with the consequent concentration of the electric field near R371 at high 
depolarizations, delineating certain symmetry in the movement of the 
voltage sensor. 

When replaced by histidine, the fifth and sixth charges (K374H and 
R377H) are not titratable from either side of the membrane at any mem- 
brane potential. This may imply that these residues are always inacces- 
sible or they do not move in the electric field. As the neutralization 
studies indicate that K374 does not contribute much charge, most likely 
this residue does not move in the field. As residue R377 could not be 
studied in the neutralized case because the protein did not express, we 
do not know whether it contributes to the gating charge. However, as it is 
even further along the sequence than any of the other charges and when 
replaced by histidine is not titratable we might consider it not moving 
in the electric field (see Fig. 3). 

The results of cysteine and histidine scanning show that the gating 
charge residues reside in a solvent-accessible region of the protein. This 
is in contrast to the idea that the charges would be buried in the protein 
and protected by salt bridges. It is interesting to compare the results 
of cysteine scanning and histidine scanning because the probe differs 
drastically in size. Thus, positions that are not accessible with the cys- 
teine reagent are accessible with protons, and this normally happens for 
positions that are far from the side where the test is made at potentials 
that take the residue away from the testing side. This result implies that 
the charges reside in water-filled crevices that may narrow down signif- 
icantly away from the protein surface. At hyperpolarized potentials the 
internal crevice will contain at least the first four charges, but the first 
charge will be in the narrowest part that will be extremely close to the 
external face. When the first charge is replaced by a histidine it makes a 
bridge to the outside, but in the normal situation, arginine has a longer 
chain and volume; therefore is unlikely that it will span as a bridge. 
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In addition, the pK, of argininine is so high that it would rarely lose 
its charge. 

These experiments have given a first glimpse of how the charges may 
reside in the protein and in relation to the bulk solution, and also show 
us that the S4 is clearly optimized as a voltage sensor. We must remember 
that the charge movement is the product of the actual charge times the 
fraction of the field it moves. Then, to minimize large movements and 
consequently large conformational changes, the field should be concen- 
trated in a small distance as seems to be the case for the narrow crevices. 
The other aspect, already mentioned, that the charges normally reside in 
the aqueous medium, has a stabilizing effect on the hyperpolarized and 
depolarized conformations. It is then clear that to change the conforma- 
tion from closed to open or vice versa it will be necessary to overcome a 
large energy barrier as the charge is translocated through the hydropho- 
bic part of the protein. Once the barrier is overcome, the charges will 
jump across in a rapid motion giving origin to the gating shots that 
have been inferred from noise analysis of the gating currents (Conti 
and Stuhmer, 1989; Sigg et al., 1994). The waiting time to overcome this 
barrier is proportional to the time constant of the gating current and 
is shortened by a favoring electric field that by subtracting to the total 
energy of the barrier, effectively lowers it, facilitating the jump. 


2. The Electric Field in the Channel Protein 


The discussion of the previous paragraph emphasizes the importance 
of knowing the potential profile in the protein structure to account 
for the charge movement and its voltage dependence. Some progress 
has been made in measuring the local electric field using fluorescent 
probes (Asamoah et al., 2002). There is a class of fluorophores that 
contain an electrochromic group with an intrinsic dipole moment that 
can be modified by the absorption of a photon. This differential dipole 
moment couples to the local electric field and perturbs the electronic 
energy levels between the ground and excited state. Consequently, the 
electrochromic effect is manifested as a spectral shift. By measuring 
the fluorescence with a filter that spans the region of maximum shift 
of the spectrum, it is possible to detect the shifts as changes in the fluo- 
rescence intensity. These electrochromic probes have been used exten- 
sively to measure membrane potential because they partition in the lipid 
bilayer, and their spectral shift is proportional to the electric field. 

One of these probes was modified so as to have a cysteine reactive 
site, and at the same time it was made less hydrophobic to decrease 
its partition in the lipid bilayer with the idea of inserting the probe in 
predetermined sites of a channel. For this purpose the Shaker K channel 
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was engineered to have cysteines in specific sites and after staining with 
dye the fluorescence was measured as a function of imposed voltage 
and time. In most cases, the time course of the fluorescence change 
was found to have two components. The fast component followed the 
charging ofthe membrane capacitance and the slow component varied 
widely depending on the position ofthe dye, voltage, and pulse duration. 
The fast component was interpreted as the electrochromic response 
of the dye, while the slow component was interpreted as a change in 
environment around the probe produced by a conformational change of 
the channel protein. Starting from a fixed potential, the electrochromic 
component had a linear dependence on the voltage of the pulse, and 
on return to the original potential, the electrochromic signal reversed 
its direction and frequently had a different magnitude. This means that 
the probe responded very quickly to the imposed potential and that the 
subsequent conformational changes in the channel made the response 
different, presumably because the field or the environment around the 
probe was modified by the voltage. 

The slope of the fluorescence change as a function of membrane po- 
tential was measured in several sites of the Shaker K channel, and it 
was found that the strength of the electric field was maximum in the 
S4 segment. The magnitude of these changes were calibrated in terms 
of the electric field using the probe that partitions in the bilayer and 
comparing the AF /F of this probe for a change in V with the equiva- 
lent value from each one of the signals recorded from the different sites 
in the channel. In the S4 the strength of the field was found to be 3 times 
the field strength in the bilayer. This means that most of the field is con- 
centrated in a thickness of about 10 A. This result is in agreement with 
the conclusion drawn from histidine scanning where the first charge 
replaced by histidine generates a proton pore (Starace and Bezanilla, 
1999). This requires a very concentrated field where the selectivity fil- 
ter of the proton corresponds to the histidine group bridging the gap 
between the inside and outside solutions. 


3. Direct Measurements of Distance Changes during Activation 


Two groups have used FRET to estimate distance changes as a result 
of changing the membrane potential (Cha et al., 1999; Glauner et al., 
1999). In FRET a donor fluorescent molecule can transfer energy to 
another probe (most generally another fluorophore) through dipole- 
dipole interactions. The efficiency of energy transfer depends on several 
known or computable factors such as the spectral overlap of the probes, 
the quantum yield of the donor, the index of refraction, and a much less 
known factor: the «? or relative orientation of the dipoles of the donor 
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and acceptor probes. It is generally assumed that the fluorophores are 
randomly oriented giving a x? = 2/3, allowing the computation of the 
distance at 50% energy transfer efficiency or R,. Knowing Ro, the dis- 
tance between the fluorophores can be computed with extreme preci- 
sion based on the energy transfer and it has a power dependence of 6 
with respect to distance. This makes the selection of the fluorophore pair 
quite critical to optimize the high precision obtained when the measured 
distance is close to the R, value. 

Distance measurements have been done on the Shaker K channel 
taking advantage that it is a homotetramer. In this way, by labeling one 
subunit the channel acquires four labels. By adjusting the ratios of the 
donor and acceptor fluorophores, it is possible to label channels with 
an excess of donors with respect to acceptors and estimate distances 
between subunits. The logic of these experiments is that if there is 
movement of the voltage sensor (S4 and its neighborhood) with re- 
spect to the center of the channel then it should be possible to detect 
a distance change between subunits when the membrane potential is 
changed. 

FRET measurements were done using the transfer of energy between 
fluorescein and tetramethylrhodamine and it was found that several sites 
in the S3-S4 linker changed positions during activation (Glauner et al., 
1999). The estimation of the energy transfer was done by recording 
the change in the photobleaching time constant when the acceptor was 
present. This required measurement on several oocytes to allow for dif- 
ferent membrane potentials and the presence and absence of acceptor. 
Their results were interpreted as a rotation with a possible transmem- 
brane translation of the S4 segment. 

Cha et al. (1999) used a variant of FRET called LRET for lanthanide- 
based fluorescence energy transfer. In this technique (Selvin, 1996) the 
donor is terbium or europium which, in fact, is luminescent. There are 
several advantages of this technique over regular FRET. It has been found 
that terbium emits isotropically, which means that the uncertainty due to 
the dipole orientation is decreased to a maximum error of +10%. This 
error can be decreased even further if the anisotropy of the acceptor is 
also known. The second advantage is that the fluorescence decay has a 
time constant of about 1.5 ms, making it easily measurable with conven- 
tional recording techniques. The third advantage is that the emission of 
terbium is peaked and one can find fluorophores that emit in between 
peaks. This means that the fluorescence of the acceptor can be measured 
with little or no contamination from the donor. In addition, as the ac- 
ceptor has a fast decay, any measurement of the acceptor fluorescence 
with decays comparable to the donor will exclude any possible direct 
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excitation of the acceptor. For these reasons, the measurement of the 
decay of the acceptor fluorescence, also called sensitized emission, is 
the cleanest way to detect transfer. The efficiency of transfer is com- 
puted from the ratio of the donor decay without acceptor (tp) and the 
sensitized emission decay (Tap), and the distance R is computed from 
Forster theory (Cantor and Schimmel, 1980) with 


ih 1/6 
nn 
Tp — TDA 


Selvin (1996) has designed a cage that holds the terbium and excludes 
most of the water that normally acts as a quencher. The chelate has a car- 
bostyril group that is excited at 334 nm by a nitrogen laser and transfer 
energy to the terbium that in turn emits. This improves enormously the 
absorption of light by terbium that has a very low extinction coefficient. 
The chelate has a maleimide group that is used to react with cysteine 
to attach the terbium chelate to the engineered site in the channel. 
The experiments of Cha et al. (1999) used this Tb chelate as donor and 
fluorescein as acceptor. Oocytes expressing Shaker K channel with the 
engineered cysteine site were stained with a mixture of Tb chelate and 
fluorescein in a 3:1 molar ratio. This ensured that most of the channels 
that contained donor and acceptor had three donors per each acceptor 
(see Fig. 4A). The channels containing only donors were invisible when 





= 2 2 
Rsa= 2 Rsc (Rt 2d) =2 (Rg¢t2d) 

Fic. 4. (A) Measured distances using LRET in the Shaker K channel. Tb represents 
the attached terbium chelate and F represents the attached fluorescein maleimide. The 
distances between contiguous subunits (Rsc) and the distances across the pore (Rsa) are 
related by the Pythagorean theorem as shown by the equation at bottom, left. (B) When 
the fluorophores have rotational freedom with radius d, Rsa and Rsc are related by the 
equation and bottom, right that differs from the equation in A. 
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measuring sensitized emission, and only a very small fraction contained 
more than one acceptor per donor, a situation that must be avoided 
because it complicates the computation of the distance. If the accep- 
tor is receiving from three donors, as it was in these experiments, each 
donor is independent because the decay time of fluorescein is orders of 
magnitude faster than the decay of Tb. Under these conditions, there 
are two possible distances that could be measured: contiguous subunits 
or across-the-pore subunits. Therefore the sensitized emission should 
show two exponential components. In fact this was verified experimen- 
tally, and the two computed distances were found related to one another 
as predicted by the Pythagorean theorem (Fig. 4A). It is interesting to 
note that if the fluorophores are not centered in the corners of the tri- 
angle, but have a large rotational freedom with a radius d, then the two 
time constants do not convert to distances related by the Pythagorean 
theorem as shown in the diagram of Fig. 4B. As most of the distances 
measured were within 1 to 2 A of the values predicted by the Pythagorean 
theorem (Cha et al., 1999), the fluorophores did not have a large rota- 
tional freedom. 

Several sites were measured and some showed changes in distance 
with voltage. The best characterized residue was position 346 in the S3- 
S4 loop that showed as much as 3.2 A change in distance with voltage, 
and the voltage dependence of the change was the same as the voltage 
dependence of the charge movement. Another set of three consecutive 
sites in the same loop but closer to the N terminus of the S4 segment 
were found to change distances with voltage. Thus residue 351 increased 
distance with depolarization, 352 did not change, and 353 decreased. 
This pattern can be best explained by a rotation of an alpha helical 
segment where the residue 352 maintains the same distance to the pore 
(center of symmetry) regardless of the membrane potential. In addition, 
all the residues tested near S4 and S4 proper (363) indicate a change in 
the tilt of S4 with change in voltage. 

The results of Glauner et al. (1999) and Cha et al. (1999) indicate 
a rotation of the S4 segment, but the experiments did not address di- 
rectly a possible translation of the segment perpendicular to the plane of 
the membrane. The measurement of this possible movement has been 
addressed by Starace et al. (2002). The approach was to measure en- 
ergy transfer efficiency between eGFP groups genetically encoded in the 
inside of Shaker to MTS-sulforhodamine attached to cysteine residues 
located on the outside loops. To calculate the energy transfer, the fluo- 
rescence of the donor (eGFP) was measured in the presence of the ac- 
ceptor (Fpa) and then the acceptor was cleaved off by DTT allowing 
the measurement of the fluorescence of donor only (Fp). The distance 
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Rwas then calculated as 


n 1/6 
j = (u) 
Fp/Fpa — 1 


The distances from the intracellular eGFP to the extracellular cysteines 
in the S1-S2 and S3-S4 loops were large but they showed very little 
voltage dependence. Some increased with depolarization and others 
decreased, but the changes were never larger than 2 Ä. These results do 
not support a large translation of S4 perpendicular to the membrane. 


4. Role of the S3-S4 Linker 


If the S4 segment moves significantly on depolarization, the linker 
between S3 and S4 may become an important constraint in this move- 
ment if it is shortened. The case of Shaker that has a total of 31 amino 
acids has been investigated in detail by Gonzalez et al. (2000, 2001) by 
studying the effect of its shortening. The linker can be reduced to seven 
amino acids by splicing out the N-terminal part of the linker without no- 
ticeable effects in the ionic currents. A further shortening of the linker 
produces an effect on the speed of channel opening and stability of the 
closed state that is periodic with the number of amino acids left in the 
linker. This periodic behavior was studied in detail and it was found to 
be equal to 3.6 amino acids, indicating the presence of an alpha helix. It 
is important to note that the channel with only three amino acids in the 
linker behaves like the wild-type Shaker, indicating that the movement 
of S4 cannot be too large unless the S3 segment is also carried by the 
movement. 

Gonzalez et al. (2001) developed a simple model that can explain the 
periodic behavior of the energy difference in opening the channel and 
the kinetics with the length of the linker. In this model, the maximum 
movement of the N-terminal part ofS4 is only 3 A based on the results of 
Cha et al. (1999), and the C-terminal part of S3 is not movable. Then, on 
depolarization, the movement of S4 will occur easily if the bonds of the 
residues in the linker can change their angles. However, certain linker 
lengths will require breaking one or two hydrogen bonds from the ex- 
tracellular portion of S4 to allow for the movement, thus explaining the 
slower kinetics and stabilization of the closed state. In some cases, the 
closed or resting state has already required some bonds to be broken 
to unwind enough S4 to achieve the correct distance between S3 and 
S4. In some of the cases the unwinding is also enough to reach in the 
depolarizing case, making no difference in kinetics or closed-state stabil- 
ity. These cases will look like the wild type as it is when the linker is only 


VOLTAGE SENSOR AND GATE IN ION CHANNELS 227 


three amino acids long. Interestingly, when the linker is eliminated com- 
pletely the gating charge is reduced. This charge reduction is predicted 
by this model because the unwinding of the S4 is such that the first basic 
residue (R362) becomes part of the linker and does not participate in 
gating charge any longer. 


D. A Model of the Voltage Sensor 


We have seen that there are many types of experiments that show con- 
formational changes during voltage sensing. The challenge is to build a 
plausible model of the operation of the voltage sensor that will account 
for all these observations. There are some other experiments that will 
impose constraints in the model such as the electrostatic interactions 
(Papazian et al., 1995) and the amino acid scanning (Monks et al., 1999; 
Li-Smerin et al., 2000; Li-Smerin and Swartz, 2001). 

Roux (2002) has used the experimental evidence available in the 
Shaker K channel to find the most probable three-dimensional arrange- 
ment of segments S1 through S4 around the KcsA pore structure. This 
procedure seems like a reasonable approach because it may be consid- 
ered as an unbiased summary of all the available information that can be 
used as a starting point to propose the conformational changes during 
voltage sensing. 

In the proposed model we will consider first that the gating charges 
of the S4 segment seem to be in contact with the aqueous environ- 
ment connected to the intracellular or extracellular side, depending 
on the voltage (see Section II,C,1). A simple way to account for this is 
to hypothesize water crevices penetrating the core of the protein. These 
crevices may be stabilized by the presence of the charged groups of 
arginine and lysine and may disappear when those groups move away 
from the crevices. The second observation to consider is the small dis- 
tance changes measured that do not seem to exceed about 3 A. This 
small movement requires that the electric field be concentrated in 
the regions where the charges move (see Section II,C,2). The model 
presented in Cha et al. (1999) and expanded in Bezanilla (2000) fits 
within these requirements. However, when one considers the results 
obtained by Roux (2002) the model must be modified to account for 
the most probable distribution of membrane segments in the chan- 
nel protein. This modified model is shown in Fig. 4. In this model 
many of the constraints have been considered including the electro- 
static interactions among the S4, S3, and S2 segments (Papazian et al., 
1995; Tiwari-Woodruff et al., 2000) as described at the beginning of 
Section II,C. 
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Roux used distance measurements from FRET and tethered blockers 
(Blaustein et al., 2000) and found that the 45 Ä measured with LRET 
(Section U,C,3) for position 363 (Cha et al., 1999) imposes the restric- 
tion that the S4 segment must be located between subunits (Fig. 5, see 
color insert). It is interesting that a cavity connected to the intracellular 
solution is generated between S5 of one subunit and S6 of the contigu- 
ous subunit where one face of the S4 segment is located. Therefore if 
that face contains the first four charges they would be in an aqueous 
environment and connected to the intracellular side. This would be the 
closed state of the sensor. To move the charge across the field when 
depolarizing the membrane, the possibilities are a large translation of 
the segment toward the outside or a rotation with a change in tilt of the 
S4 segment to expose the charges to another crevice connected to the 
outside. Most likely the movement may involve a combination of both 
but with a rather restricted translation to account for the results of FRET 
(Section II,C,3) and short S3-S4 linker (Section I,C,4). In the model 
depicted in Fig. 5 only a simple rotation has been implemented just to 
illustrate that it is possible to translocate most of the charges with a small 
movement. A reliable model of voltage sensor movement is still in the fu- 
ture because the few measurements available are not consistent with one 
another and many more measurements are needed to precisely locate 
the charge positions in the closed and open configurations. The recently 
cloned sodium channel from Bacillus halodurans (Ren et al., 2001) opens 
the possibility of precise measurements that may eventually give us a de- 
tailed picture of the conformational changes that occur during voltage 
sensing. Once these conformations are known it will be possible to fi- 
nally link the kinetic models that have been developed in extreme detail 
with a physical model of charge movement. 


Ill. THE CHANNEL GATE 


A. KcsA as a Model of Ionic Permeation and Gating 


The Streptomyces lividans Kt channel (KcsA) is a 160-residue protein 
that forms homotetrameric channels closely related to the pore domain 
of larger voltage-dependent channels (Schrempf et al., 1995). When pu- 
rified and reconstituted in lipid bilayers, KcsA catalyzes single-channel 
activities with selectivity properties identical to those of other eukary- 
otic Kt channels (Cuello et al., 1998; Heginbotham et al., 1999; Meuser 
et ol. 1999). The fact that KcsA is easily expressed in Escherichia coli at 
milligram levels made this protein an ideal target for structural analysis. 
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Crystallographic studies led to the high-resolution structure of KcsA by 
Doyle et al. (1998b), an achievement that lent a firm structural founda- 
tion to more than three decades of functional work on Kt channels. The 
crystal structure revealed that KcsA is formed by the association of four 
subunits, contributing equally to form a water-filled pore. Each subunit 
has two transmembrane segments, TM] in the periphery of the com- 
plex and TM2 lining the permeation path. Toward the extracellular face 
of the channel is the selectivity filter, where intimate contact with the 
permeant ions takes place. 


B. Site-Directed Spin Label Analysis of KcsA 


With the idea of obtaining structural dynamic information of the two 
transmembrane segments, an extensive site-directed spin label (SDSL) 
study of KcsA in its native environment was undertaken (Perozo et al., 
1998). Sixty-six single cysteine mutants were prepared and analyzed for 
residues 22-52 and residues 86-120 of KcsA, where the two putative 
transmembrane segments are thought to be located. Using discrete 
Fourier transform methods, angular frequency information was ex- 
tracted from residue environmental profiles from each of the scanned 
protein segments and used in the assignment of secondary structure 
elements (Perozo et al., 1998). 

Collision of nitroxides with fast-relaxing radicals, such as oxygen and 
metal ion complexes, causes spin exchange that effectively shortens the 
spin-lattice relaxation time T of the nitroxide (Hyde and Subczynski, 
1989). This effect can be measured either by continuous wave (CW) 
power saturation techniques or by saturation recovery methods. Colli- 
sion frequency is directly proportional to the accessibility of the para- 
magnetic reagent to the nitroxide radical and is defined as 


_ APi(X) AM (DPPH) 
Depp) AH, 





where Pı,e(DPPH) is the microwave power that saturates the signal rel- 
ative to that without saturation for a 2,2-diphenyl-1-picrylhydrazyl crys- 
tal, AP /9(X) = [P1j2(X) — Pij2(Ne)], the difference in Dua of a given 
sample exposed to a fast-relaxing paramagnetic species (such as Os or 
NiEdda) and Na (Altenbach et al., 1990). AH, is an estimate of probe 
mobility (Mchaourab et al., 1996; Columbus et al., 2001). This II parame- 
ter can be used to deduce secondary and tertiary structure information, 
which is derived from the periodic behavior of a series of sequential 
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mutants in the presence of relaxing agents. Solvent accessibility infor- 
mation derived from either Os or NiEdda collision frequencies was in 
agreement with the crystal structure, suggesting that TM1 has a large 
lipid-exposed surface, while TM2 is more protected from membrane 
lipid. An excellent example of the use of solvent accessibilities to de- 
scribe unknown structures is found in the recent SDSL analysis of KcsA 
cytoplasmic domains (Cortes et al., 2001). Because of the requirements 
specified by the successful crystallization conditions, no structural infor- 
mation exists on these regions of KcsA (Doyle et al., 1998). Thus, regions 
in the N and C termini of KcsA predicted to be soluble and exposed to 
the cytoplasm were probed by site-directed spin labeling. The results can 
be viewed in the form of a residue environmental phase diagram by plot- 
ting Ni accessibility (IINiEdda) versus Os accessibility (IIOs, Fig. 6, see 
color insert). The structural phases can be classified as aqueous (high 
IINiEdda and low Og), lipid exposed (high TO% and low IINiEdda), 
and buried (both low IINiEdda and low IIOs). From these diagrams it is 
clear that the N-terminal region of KcsA resides at the membrane inter- 
face (straddling both membrane-exposed and water-exposed regions, 
Fig. 6A, top), while the C terminus is wholly exposed to the aqueous 
environment (Fig. 6A, bottom). Using this data set, together with the 
known coordinates from the crystal structure, it was possible to develop 
a backbone three-dimensional model of full-length KcsA (Fig. 6B) to 
complete the structural description of KcsA with the membrane as a 
reference point and to provide a structural basis for understanding its 
mechanism of activation gating. 

Spin-spin interactions have played a key role in the structural analy- 
sis of KcsA and in the molecular description of its activation mechanism 
(see below). Due to the tetrameric stoichiometry of KcsA, spin-spin cou- 
pling can originate either from interactions between next neighbors or 
from diagonally related subunits (Fig. 7A, see color insert; only the di- 
agonal subunits are shown). Thus, in an ideally labeled KcsA, four spin 
labels would be contributing to the spectral broadening (Fig. 7B). Under 
these conditions, a qualitative estimate of the magnitude of spin-spin 
interactions can be obtained from the ratio of amplitudes of the central 
resonance line (M = 0) between the underlabeled and fully labeled mu- 
tant, both normalized to the total number of spins in the sample (Perozo 
etal., 1998). This interaction parameter, Q2, was used to survey patterns of 
spin coupling along the length of TM2 and deduce possible interhelical 
orientations. Strong broadening was in fact observed every three or four 
residues on one face of the C-terminal half of TM2 (in residues A108, 
T112, V115, and G116), as shown from the comparison between their 
spectra at underlabeling conditions (Fig. 7B, left) and those obtained 
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from fully labeled channels (Fig. 7B, center). In agreement with the 
relative intersubunit angle between TM2 segments, significantly weaker 
spin-spin coupling (as measured from the Q parameter) was detected 
on the N-terminal half of the helix (Fig. 7C). In a parallel SDSL analy- 
sis of KcsA, Gross et al. (1999) have shown essentially the same type of 
results in TM2, both for solvent accessibility and for the distribution of 
spin-spin coupled residues. 


C. Helical Movement during KcsA Gating 


The structure of KcsA can be used as a remarkable blueprint to explain 
the principles of ion permeation (Doyle et al., 1998). The gating mech- 
anism, however, has been harder to deduce from direct examination of 
the KcsA structure. The key breakthrough that gave a handle on the 
spectroscopic analysis of KcsA gating was the discovery that KcsA can be 
opened at low pH (Cuello et al., 1998). This was done using, bulk $°Rb+ 
flux assays and at the single-channel level in bilayer systems (Fig. 8), and 
thus established that the KcsA crystal structure most likely represented 
a closed conformation. Although initially the proton site was thought to 
be extracellular (Cuello et al., 1998), more recent studies have clearly 
demonstrated that channel opening is induced by intracellular changes 
in pH (Heginbotham et al., 1999; Meuser et al., 1999). 

Earlier results from voltage-dependent channels suggested that chan- 
nel opening was associated with global conformational changes in the 
intracellular face of the channel (Armstrong, 1971; Liu et al., 1997). The 
first direct demonstration of the types of movements that may be associ- 
ated with channel opening came from Perozo et al. (1998) as large pH- 
dependent changes in intersubunit spin-spin coupling were observed at 
selected residues along the C-terminal end of TM2 (Figs. 7B, 7C). From 
the crystal structure, these residues were shown to be located at the nar- 
rowest section of the internal helix bundle, close to the fourfold axis of 
symmetry. In each case, spectra obtained at low pH showed a reduction 
in the magnitude of the intersubunit dipolar coupling (Fig. 7C), leading 
to the suggestion that channel opening was a consequence of a widening 
in the inner vestibule of the channel. 

A more through analysis of these movements was subsequently car- 
ried out by studying changes in mobility and dipolar coupling along the 
main structural elements in the channel: the two transmembrane seg- 
ments TM1 and TM2, as well as the regions flanking the selectivity filter 
(Perozo et al., 1999). The changes in spectral line shape were analyzed 
using discrete Fourier transform methods to detect possible patterns that 
could indicate specific types of movements in each of the TM segments 
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Fic. 8. Proton activation of KcsA. (A) Time course of ®#Rb+ uptake depends on ex- 
travesicular pH. Open circles, uptake at pH 7.0 (left) or 3.5 (right). Filled circles, uptake 
in the presence of 10 mM Bal in both sides of the membrane. (B) pH effect on single- 
channel properties of KcsA in planar lipid bilayers. Top: Consecutive single-channel traces 
obtained at pH 4.0, 4.5, 5.0 and 6.0 for a multichannel asolectin bilayer. Bottom panels: 
All-point histogram corresponding to each set of single-channel records. 


originating from pH acidification. The majority of the residues along 
TM2 showed a reduction in the extent of spin-spin coupling as mea- 
sured by the Q parameter, indicating a translational movement or he- 
lical tilt away form the symmetry axis that would indicate an increase 
in the diameter of the internal vestibule of the channel. Vector analysis 
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of these changes in the context of a helical wheel projection indicated 
that the pattern of change in intersubunit proximity could be explained, 
in addition to the helical tilt, by a rotation of the individual TM2 helices 
in an anticlockwise direction (Perozo et al., 1999). 

Similar analysis of the spectral changes in the “external” transmem- 
brane segment TM1 also revealed helix rotations and tilts compatible 
with a rigid-body movement of the entire helix. However, because TM1 
is located further away from the axis of symmetry, changes in intersubunit 
residue proximity are more subtle and harder to detect than in TM2. For 
this reason, and given that less than half of TM] residues are involved in 
any extensive tertiary contacts, changes in probe mobility (AA Æ) area 
very sensitive indicator of helix tilting and rotation. As in TM2, these data 
suggested an anticlockwise rotation of TM1 on channel gating. Limited 
data on the types of conformational changes occurring in the vicinity of 
the selectivity filter showed subtle changes in both probe mobility and 
spin-spin interaction (Perozo et al., 1999). Interestingly, spin labels lo- 
cated at the C-terminal end of the pore helix reported larger changes 
in spectral line shape than did those in the extracellular vestibule of the 
channel. Taken together, these spectral changes might be suggestive of 
possible conformational rearrangements that could affect the energetics 
of ion coordination at the selectivity filter. 

As stated above, distance determinations in KcsA (and in other 
oligomeric proteins in general) are complicated by the geometric re- 
lationship between subunits, as a fully spin-labeled channel will exhibit 
dipolar interactions between both neighboring and diagonally related 
subunits. This problem has been solved by the use of tandem dimer 
constructs to determine changes in intersubunit distances at the in- 
ner helical bundle of KcsA (Liu et al., 2001). Using tandem dimers 
a series of constructs were generated so that pairs of spin labels, lo- 
cated along the critical TM2 bundle crossing, were used to calculate 
distance changes in the channel gate between the closed and open 
conformations. 

Two nitroxides, separated by the distance r, are coupled through space 
via electron-electron dipolar interactions arising from the unpaired 
electrons. Spin coupling induces line splitting dependent on their sepa- 
ration and their orientation with respect to the magnetic field according 
to 


2B = (3/2) geß(3 cos 0 — 1) /r° 
where 0 is the angle between the interspin vector and the magnetic 


field, ge the electron g value, and £ the Bohr magneton. In solution, @ is 
distributed isotropically, and thus line splitting is observed instead as an 
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overall spectral broadening. The extent ofthe broadening is directly cor- 
related to the interspin separation, which can be extracted by Fourier 
deconvolution of the broadened and unbroadened (noninteracting) 
EPR spectra with a ~2 A accuracy within the 8-25 A range (Rabenstein 
and Shin, 1995). These intersubunit distances were then used to derive 
a backbone structure for the KcsA inner helical bundle in the open state 
with the aid of a novel computational algorithm. This computational 
approach (restraint-driven Cartesian transformation, ReDCaT) is based 
on an exhaustive sampling of rigid-body movement in Cartesian space 
and was used to determine the type, direction, and magnitude of the 
conformational changes in TM2 using the limited distance information 
(Sompornpisut et al., 2001). The overall picture of KcsA gating emerging 
from these studies is summarized in Figure 9 (see color insert). Assuming 
complete rigidity of the helical TM segments, the gating model proposes 
that on lowering the pH, TM1 and TM2 undergo an anticlockwise rota- 
tion and a simultaneous movement away from the symmetry axis of the 
tetramer, opening the intracellular side of the permeation path (Fig. 9A). 
Quite recently, however, the crystal structure of another prokaryotic 
K* channel, MthK from Methanobacterium thermoautotrophicum (a 
Ca?*-activated channel), has been obtained in its open conformation 
(Jiang et al., 2002b). With a >12 A wide opening, this remarkable struc- 
ture revealed larger overall movements than those modeled in KcsA 
(Fig. 9B), and this correlates well with the differences in maximal P, be- 
tween KcsA (low maximal P,) and MthK (high E. max). Nonetheless, the 
type and direction of the helical movements at the level of the gate seen 
in the MthK structure are in complete agreement with those derived 
from EPR measurements (Fig. 9A). A key finding in the new open struc- 
ture is the fact that in MthK, gating may originate as a consequence of a 
glycine hinge in the middle of the helix that helps bend the C-terminal 
half of TM2 away from the permeation path. 


D. The Role of the Selectivity Filter in the Gating Mechanism 


Once significant pH-dependent movements in TM2 had been demon- 
strated, an important point that needed to be addressed was, how do 
these rearrangements fit within the overall gating mechanism of KcsA? 
That is, are the conformational changes in TM2 necessary and sufficient 
to open the channel? Given the structure of full-length KcsA, we thought 
of two additional regions that could participate in KcsA gating. On one 
hand, since the C terminus forms a helical bundle aligned to the fourfold 
symmetry axis, it could restrict ion permeation by reversibly plugging 
the entrance to the internal vestibule. This possibility was examined 
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in a series of deletion experiments (Perozo et al., 1999; Cortes et al., 
2001), which demonstrated that although complete truncation of the 
C terminus affects both pH dependence and the stability of the closed 
state, the channel was able to fully close by moving to a slightly more 
alkaline pH. Therefore, the C terminus cannot be the activation gate of 
the channel. 

The second region under consideration was the selectivity filter. Coor- 
dination of KI ions at the selectivity filter establishes strict geometric 
constraints in order to optimally select against impermeable ions while 
allowing for fast Kt ion translocation. Therefore, small perturbations in 
selectivity filter geometry can have significant consequences for its ability 
to conduct ions. This fact makes the selectivity filter an energetically eco- 
nomic gate. Indeed, a number of independent pieces of evidence seem 
to point to the selectivity filter as a region with a great deal of influence 
over the gating behavior of a channel. There are well-documented exam- 
ples of permeant ion effects on gating (Swenson and Armstrong, 1981; 
Spruce et al., 1989; Shapiro and Decoursey, 1991; Demo and Yellen, 1992; 
Clay, 1996; Mienville and Clay, 1996; Chen etal., 1997; Pusch et al., 2000). 
Ions with long occupancy times (Rbt, Ost, NH4*) tend to stabilize the 
open state through a “foot in the door” effect of the gate, even though 
the only region of channel-ion interaction occurs at the selectivity filter. 
Additionally, channels seem to populate subconducting states on the way 
to the open state (Chapman et al., 1997; Zheng and Sigworth, 1997), and 
these subconducting states show altered selectivities. More recently, un- 
natural amino acid mutagenesis targeted to the signature sequence of 
an inward rectifier Kt channel had dramatic effects on the rapid gating 
transitions of an inward rectifier channel (Lu etal., 2001), again pointing 
to the selectivity filter as a contributor to the gating process. 

Small conformational changes were detected in a few residues near 
the selectivity filter when the gating mechanism was under examina- 
tion (Perozo et al., 1999), but the number of residues was too limited 
and the magnitude of the conformational changes too small to allow 
these changes to be interpreted in terms of specific protein motions. 
Recently, the overall conformational rearrangements associated with 
pH-dependent gating within the P-loop were analyzed in a more com- 
plete set of residues: residues 64-74 in the pore helix and residues 81-84 
in the outer vestibule of the channel (Cuello and Perozo, 2002). EPR 
spectra obtained at neutral and acidic pH revealed changes in spin la- 
bel mobility on each side of the signature sequence. Although subtle, 
these rearrangements were remarkably reproducible and were always 
observed under the same conditions that promoted major movements 
of the TM2 segments. 
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Although it was clear that there were reproducible changes around 
the selectivity filter on gating, additional evidence was needed to actu- 
ally link the P-loop to the gating transitions in KcsA. It has been known 
for many years that some permeant ions can have profound effects on 
the gating behavior of a variety of Kt channels. These effects seem to 
be correlated with the degree of occupancy of the ion in the selectiv- 
ity filter (Demo and Yellen, 1992). Given that in Kt channels intimate 
ion-protein interactions only take place at the selectivity filter (Doyle 
et al., 1998b), the role of the P-loop in activation gating and its influ- 
ence on other moving parts of KcsA were probed using different per- 
meant ions (Cuello and Perozo, 2002). With Kt as a charge carrier, 
the single-channel behavior of KcsA is characterized by fast gating tran- 
sitions (mean open times 1-2 ms) clustered in bursts of activity sepa- 
rated by long silent periods. However, when Rb* is the charge carrier, 
open times were increased more than 10-fold (main Topen = 165 ms) 
with a sharp reduction in single-channel conductance (from ~100 to 
~8 pS), yet the closed times remained broadly unchanged. Open prob- 
abilities increased from the standard P, = 0.05 — 0.1 in symmetric Kt 
to P, ~ 0.5 in symmetric Rb*. The next question was how these gating 
effects, induced by different permeant ions, are reflected in the struc- 
tural rearrangements measured by EPR. Control experiments in high 
[Kt] revealed only residual conformational changes in both the P loop 
and the intracellular gate (Fig. 10, see color insert). However, when 
the experiment was repeated in high [Rb*] (Fig. 10, right), significant 
changes in line shape were observed both in the P loop and in TM2. 
Thus, by promoting a small structural rearrangement in the selectivity 
filter with a higher affinity permeant ion (Rbt), it was possible to detect 
significant conformational rearrangements in TM2, at the opposite end of 
the molecule. These results suggest that the internal gate and the P-loop 
are coupled both structurally and functionally, lending support to the 
notion that the selectivity filter does participate in gating during normal 
channel operation. 

Additional evidence linking the selectivity filter to the gating ma- 
chinery was obtained from single-channel analysis of site-directed mu- 
tants. Reasoning that mutations that stabilize channel-ion interactions 
will produce a gating behavior similar to Rb* ions, Cuello and Perozo 
(2002) searched for mutants that alter KcsA gating properties at the 
single-channel level. In the P-loop, they studied mutations targeted to 
the pore helix (T72C, A73C, T74C) and the external vestibule (Y82C). 
These mutations produced no major changes in the ability of KcsA to 
sense proton concentrations, as determined from macroscopic ME 
uptake experiments. However, cysteine substitutions in these pore helix 
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Fic. 11. Gating as a consequence of the structural interplay between two “gates”. 
(A) The present data are interpreted according to a minimal four-state sequential model 
in which movement of the channel intracellular gates (TM2) allows the selectivity filter 
to fluctuate and enter a “conductive” conformation that corresponds to the open state. 
In transit from the closed state the channel must populate an intermediate open, non- 
conductive state. As these states are conformationally coupled, the model predicts that 
the intracellular gate is unlikely to close before the selectivity filter returns to its “non- 
conductive” state. (B). The model suggests that the duration of a burst is governed by 
the C <> Cg equilibrium, occurring in the intracellular gate, while the intraburst activity 
(channel flicker) is determined by the Cg < O transition. 


residues had substantial effects on the kinetics of KcsA. This particu- 
larly applies at position 72 where the mean open time increases from 
~2 ms in wt-KcsA to ~55 ms in T72C. Open times were also lengthened 
for mutants A73C and T74C, while Y82C, the only mutation tested in 
the outer vestibule, showed no significant kinetic differences compared 
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to wt-KcsA. It must be noted that these mutations increase mean open 
times even under symmetric K*, producing an overall stabilization ofthe 
open state in a manner similar to that described above for Rb* above. 
High-affinity permeant ions such as Rbt and Cst and specific muta- 
tions in the pore helix seem to be influencing gating through a “foot 
in the door” effect, as suggested a long time ago by Swenson and Arm- 
strong (1981). This phenomenon was thought to be analogous to the 
actions of large quaternary ammonium ions in preventing the closure 
of the internal gate of many Kt channels (Armstrong, 1971). However, 
although they are mechanistically similar, permeant ions and pore mu- 
tants can only exert their effect by affecting the conformation of the 
selectivity filter. Thus, all of the measured functional and structural ef- 
fects observed in the internal gate might originate as a consequence of 
the conformational coupling between these two structures. In this in- 
terplay, it has been suggested (Cuello and Perozo, 2002) that opening 
of the intracellular gate (TM2) defines the length of a given single- 
channel burst and sterically allows conformational flexibility in the se- 
lectivity filter. The profound influence of permeant ions on KcsA open 
time distributions and its consequent effects on conformational changes 
indicate that the P-loop may be responsible for defining the fast gat- 
ing properties of the channel (flicker) and thus, the mean open times 
(Fig. 11). 
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I. INTRODUCTION TO RHODOPSIN AND VISUAL SIGNAL TRANSDUCTION 


G-protein coupled receptors (GPCRs) are a large superfamily of in- 
tegral membrane proteins that serve as input stages to molecular sig- 
nal transduction systems, the outputs of which modulate various cel- 
lular responses. The input signals, which may be molecular ligands or 
physical stimuli, are highly diverse, and particular GPCRs have evolved 
to interact with specific signals. Rhodopsin, the photoreceptor of the 
vertebrate retina, defines the largest subfamily of GPCRs (Gether, 2000). 


243 Copyright 2003, Elsevier Science (USA). 
ADVANCES IN All rights reserved. 
PROTEIN CHEMISTRY, Vol. 63 0065-3233/03 $35.00 


244 HUBBELL ET AL. 


rhodopsin 









C. 
disk membrane 


y extracellular (intradiscal) surface 


cytoplasmic surface 


electrogenic 
ion pump 


B ™1 "Mä TM3 TM4 TMS TM6 TM7 H8 

































































op extracellular surface 


RHODOPSIN STRUCTURE, DYNAMICS, AND ACTIVATION 245 


The receptor is localized in highly organized intracellular organelles, 
the disk membranes, in the rod photoreceptor cell (Fig. 1A). Physiolog- 
ically, the effects of photon absorption by rhodopsin are closure of ionic 
channels in the plasma membrane and a concomitant change in the volt- 
age across the plasma membrane of the cell. The light-to-voltage trans- 
duction process is highly amplified and essentially thermally noiseless 
(Baylor, 1987; Burns and Baylor, 2001). For example, a single absorbed 
photon in arod cell triggers an electrical response with a probability of at 
least 0.5, and during the response the flow of 710” ions is blocked from 
passing through ion channels in the plasma membrane corresponding 
to a change of ~3% in the transmembrane potential. Despite an ex- 
tremely high density of rhodopsin, 10° per cell, the dark noise of the 
photoreceptor cell is the equivalent of only one photon per 100 s. The 
subject of this review is the molecular mechanism whereby rhodopsin 
initiates and shapes this remarkable process. 

Rhodopsin holds a special place in the GPCR superfamily. It was the 
first GPCR to be sequenced (Ovchinnikov, 1982; Hargrave et al., 1983), 
it has consistently led the way as a paradigm for other family members, 
and it was the first whose structure has been determined at the atomic 
level (Palczewski et al., 2000). The general structural motif of rhodopsin, 
and all GPCRs, is a seven-transmembrane (TM) helix bundle (Fig. 1B) 
that was evident in early work from sequence and biochemical analysis 





Fic. 1. (A) A schematic representation of the rod photoreceptor cell and the intra- 
cellular disk membrane organelles (for more detail, see Fein and Szuts, 1982). The inset 
represents a magnified view of a portion of a disk, showing the location of the rhodopsin 
molecule and the very small space separating the two membranes within a disk (the ex- 
tracellular space). An electrogenic ATP-driven ion pump generates an extracellular Nat 
current (the “dark” current) and a transmembrane potential. The current returns to the 
cytoplasm via c-GMP-dependent ion channels in the plasma membrane. Light absorp- 
tion by rhodopsin triggers closure of the channels, thereby hyperpolarizing the plasma 
membrane. (B) The secondary structure of rhodopsin. The transmembrane helices are 
TMI-TM7T, and H8 isa short interfacial helix. The segments connecting the TMs at the cy- 
toplasmic surface of the receptor are numbered C1-C3, and those at the extracellular sur- 
face E1-E3. The structure of C3 is based on site-directed spin labeling (see Section III,E). 
Kinks in the TM helices are indicated as breaks, and the kink angles indicated (Teller et al. 
2001). The horizontal lines indicate the approximate location of the membrane/aqueous 
interface (see Section IV). Residue K296 and E113, identified with shaded square sym- 
bols, are the site of the protonated Schiff base attachment of the 11-cis retinal chro- 
mophore (not shown) and its counterion, respectively. Diamond symbols mark residues 
that are highly conserved throughout the rhodopsin subfamily of GPCRs. Sequences at 
the cytoplasmic surface directly involved in transducin interaction are shaded. These 
same sequences are also implicated in binding of the regulatory proteins arrestin 
and rhodopsin kinase. Zigzag lines at 322C and 323C represent palmitoyl residues, 
while phosphorylation sites in the C-terminal tail are marked by filled square symbols. 
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(Hargrave, 2001), later strongly supported by cryoelectron microscopy 
(Unger et al., 1997), and finally verified by the recent crystal structure. 
The biochemistry that mediates photon absorption in the disk mem- 
branes and closing of the ionic channels in the plasma membrane is 
summarized in Fig. 2 and discussed in detail in reviews that provide ac- 
cess to the original literature (Helmreich and Hofmann, 1996; Menon 
et al., 2001; Hamm 1998, 2001; Bunemann and Hosey, 1999; Krupnick 
and Benovic, 1998). In the dark (inactive) state, rhodopsin (R) con- 
tains a covalently bound 11-cis retinal chromophore buried within the 
protein core. In rhodopsin, the chromophore absorbs maximally around 
500 nm, and absorption of a photon isomerizes the retinal to the all-trans 
form within about 200 fs (Peteanu et al., 1993) with a quantum effi- 
ciency of 0.67 (Dartnall, 1972). The change in chromophore shape cre- 
ates strain, and subsequent thermal relaxation of both chromophore 
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and protein through a series of spectrally defined intermediates cul- 
minates in a conformational switch of rhodopsin to metarhodopsin II 
(MH), the active signaling species (R*). The signal is passed from R* 
to the heterotrimeric G protein, transducin TG: (GDP)], by a direct 
binding interaction. The R*-transducin interaction triggers a conforma- 
tional switch in the a-subunit of transducin, causing dissociation of a 
bound GDP to form an empty complex [Gwwgy(0)]. Binding of GTP 
to the empty complex triggers yet another conformational switch in 
the a-subunit resulting in its dissociation from the complex in an ac- 
tive state [G,*(GTP)]. In succeeding steps in the cascade, Gy* (GTP) 
activates a phosphodiesterase (PDE), thereby reducing the cytoplasmic 
level of c-GMP which closes a c-GMP regulated ion channel in the plasma 
membrane (Fig. 1A). 

In this cascade, there are three stages of molecular amplification that 
account for the high gain of the rod cell response: (1) each R* acti- 
vates multiple transducin molecules; (2) each activated PDE hydrolyzes 
many molecules of c-GMP, and (3) many channels respond to the drop 
in c-GMP. The high-gain amplifier is shut down at the rhodopsin level 
through phosphorylation of R* (in the C-terminal domain) by rhodopsin 
kinase, and subsequent binding of arrestin to the phosphorylated seg- 
ments (Fig. 2B). Thus, the relatively small exposed cytoplasmic surface 
of rhodopsin (+1500 A?) must display recognition and interaction sites 
for transducin, arrestin and rhodopsin kinase each of which is of higher 
molecular weight than rhodopsin itself. 

Each of the amplification stages is a potential source of the dark noise 
mentioned above (1 activation/100 s). The spontaneous activation of 
rhodopsin accounts for about 50% of the noise, and it can be estimated 
that a thermal activation event occurs on the average once every 2000 
years (Burns and Baylor, 2001). This extremely low thermal activation 
rate is due to the covalently linked chromophore that provides extreme 
stability to the “off” state of the protein. 

In a general sense, the transduction/amplification system and shut- 
down mechanisms outlined in Fig. 2 are relays of molecular switches, 
each triggered in succession. The input signal (light) triggers the rho- 
dopsin switch, and the signal is passed along the relay by either protein- 
protein or small molecule-protein interactions that trigger the next 
element in the relay. To understand the operation of the system, it is 
then necessary to elucidate the molecular switch in each of the ele- 
ments, and to map out the protein-protein interaction surfaces. At the 
present time, crystal structures are known for G„(GDP) (Lambright, 
1994), G„(GTPyS) (Noel et al., 1993), Gap, (GDP) (Lambright, 1996), 
rhodopsin (Palczewski et al., 2000; Teller et al., 2001), and arrestin 
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(Hirsch et al., 1999). Because these are static images, they cannot by 
themselves reveal the mechanisms of molecular switching. However, they 
do form an essential foundation for the design of molecular genetic and 
spectroscopic experiments that will ultimately elucidate the dynamic 
events of molecular switching, as well as identify the protein-protein 
interactions. For example, site-directed mutagenesis and peptide com- 
petition studies have already identified key sequences in rhodopsin that 
are involved in transducin interaction and activation (Franke et al., 1992; 
Konig et al., 1989; Marin et al., 2000; Yang et al., 1996a). As shown in 
Figure 1B, these regions are confined to the cytoplasmic links G1-C3 
and the H8 helix. 

Over the past decade, cysteine scanning mutagenesis and associated 
techniques of site-directed spin labeling (SDSL), sulfhydryl reactivity, 
and disulfide cross-linking kinetics have been systematically employed to 
assign structural features to rhodopsin sequences throughout the entire 
cytoplasmic surface, and to monitor changes in the protein conforma- 
tion in this critical region upon light activation. This comprehensive 
approach was made possible by the total synthesis of the rhodopsin 
gene (Ferretti et al., 1988) and the development of expression systems 
and purification methods that provided mutant proteins in sufficient 
quantities and in a high state of purity (Oprian et al., 1987; Reeves 
et al., 1999). The strategies employed in this seminal work have been 
reviewed (Khorana, 2000). 

The results of these studies revealed key features of the structure and 
the dynamic nature of the cytoplasmic surface of rhodopsin in solution. 
Most important, they provided the first direct structural evidence for he- 
lix movements underlying receptor activation in a GPCR (Farahbakhsh 
etal., 1993; Altenbach etal., 1996; Farrens et al., 1996), and more recently 
a complete description of the spontaneous structural changes that occur 
at the cytoplasmic surface on formation of the activated state (Altenbach 
etal., 1999a,b, 2001b,c; Cai et al., 1999a,b, 2001; Klein-Seetharaman et al., 
1999, 2001; Langen et al., 1999). This review will be focused on a col- 
lective interpretation of data from SDSL, sulfhydryl reactivity, disulfide 
cross-linking, and the recent crystal structure to provide a view of the 
resting and active state of rhodopsin at the cytoplasmic surface and to 
identify at least part of the rhodopsin switch. 

In Sections II and III, the crystal structure of rhodopsin is briefly re- 
viewed and compared with the dynamic structure in micellar solutions 
and membranes as inferred from the biophysical methods mentioned 
above. A structural model of the cytoplasmic surface derived from solu- 
tion NMR of peptides has been presented (Yeagle et al., 1997; Katagadda 
et al., 2001), but this approach does not provide direct information on 
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functional dynamics in the intact molecule or conformational changes 
leading to the activated state and will not be described in this article. 
Section IV examines the supramembrane organization ofthe disk mem- 
brane as interpreted from the recent rhodopsin crystal structure. In 
Section V, experimental evidence bearing on the rhodopsin conforma- 
tional switch is reviewed. The final section, Section VI, reviews structure— 
function relationships deduced from a collective consideration of the 
crystal structure, the dynamics of the solution structure, and the confor- 
mational changes underlying activation. 


II. THE RHODOPSIN CRYSTAL STRUCTURE: THE INACTIVE STATE 


A structural model of rhodopsin in the dark (inactive) state derived 
from electron density maps refined at 2.8 A is shown in Fig. 3 (see color 
insert) (Teller et al., 2001). As is customary, the structure is discussed with 
respect to three topological (as opposed to folding) domains: transmem- 
brane, cytoplasmic, and extracellular (intradiscal). The transmembrane 
helical segments (i.e., the segments of the helices within the bilayer inte- 
rior) are referred to as “TM1-TM7,” and the single helix lying parallel 
to the bilayer surface is designated “H8.” The sequences corresponding 
to these segments are given in the legend to Fig. 3, and the basis of the 
assignments is discussed below in Section IV. The segments that link the 
TM helices on the cytoplasmic and extracellular surfaces will be referred 
to as CI-C3 and E1-E3, respectively. Finally, the sequence beyond the 
palmitoylation sites from 324 to 348 is designated the C-terminal tail. 

In the crystals used to obtain the model of Fig. 3, there are two 
molecules in the asymmetric unit, molecules A and B. Molecules A and B 
differ in the degree of order within the crystal, as reflected in the plot of 
thermal factors in Fig. 4A. Gaps in the plot of Fig. 4A reflect portions of 
the chain that could not be modeled because of the extent of disorder. 
Molecule B has more extensive gaps and has generally more disorder 
throughout. The model of Fig. 3 corresponds to the most highly ordered 
Molecule A, and unless stated otherwise, all models of rhodopsin in this 
review will be based on this structure. The A structure has been analyzed 
in detail in several recent publications with respect to helix tilts, kinks, 
and interactions stabilizing the inactive state (Teller et al., 2001; Menon 
etal., 2001; Okada et al., 2001), the role of specific residues in “tuning” the 
absorption wavelength of the chromophore (Teller et al., 2001; Menon 
et al., 2001), the relationship to inherited visual diseases (Menon et al., 
2001; Okada et al., 2001), and comparison with bacteriorhodopsin (bR) 
(Teller et al., 2001). Some key points related to rhodopsin activation will 
be summarized below. 
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Fic. 4. (A) Plots of the thermal (B) factors as a function of sequence position for 
molecules A and B of the asymmetric unit (pdb entry IHZX). The shaded and un- 
shaded vertical bars mark the TM and interconnecting segments, respectively. (B) Aligned 
molecules A and B showing the “plastic” regions at the cytoplasmic surface where they 
differ. At the extracellular surface and in the TM sequences, they are well aligned. Dark 
and light ribbons mark the A and B chains, respectively. 
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The helices of the rhodopsin transmembrane domain are distin- 
guished from those of bR by the large number of irregularities and 
kinks, due primarily to Pro and Gly residues (Figs. 1B and 3). These 
kinks provide potential points of flexibility in the otherwise rigid helical 
rods. One minor kink, at Ser-127 in TM3, may be due to H bonding of 
the side chain -OH with the backbone carbonyl of Ile-123, and as such 
would not constitute a point of flexibility. Of particular interest is Pro-267 
in TM6. This residue is completely conserved throughout the family of 
rhodopsin-like receptors and is likely to be a fundamental element of 
the activation switch (Baldwin et al., 1997; see Section V,C). 

Many of the residues conserved throughout the rhodopsin receptor 
family are located in transmembrane helical segments near the cyto- 
plasmic surface, or their extensions into the cytoplasmic space (Baldwin 
et al., 1997; Gether, 2000). This is anticipated, because the cytoplasmic 
surface in the different receptors is designed for interaction with pro- 
teins of a common signal transduction cascade, whereas the extracel- 
lular surface is specialized for interaction with particular ligands. The 
highly conserved residues (>90%) are identified in Fig. 3 by spheres 
at the a-carbon atoms. One group is clustered at the interaction sur- 
faces of TM1 (N55), TM2 (L79, D83), and TM7 (N302, P303, Y306, the 
NPXXY motif) and are involved in a network of interhelical interac- 
tions that stabilize this part of the helix bundle. Another group is at the 
cytoplasmic termination of TM3 (E134, R135, V139) (Palczewski et al., 
2000). The D134/R135 pair is salt-bridged or H-bonded (Honig and 
Hubbell, 1984) and buried within the hydrophobic environment of the 
helix bundle. While R135 may make interactions with TM6 and stabilize 
its position in the inactive receptor (Teller et al., 2001), itis also clear that 
the E134/R135 pair is directly involved with functional coupling to the 
G protein (Franke et al., 1990). Conserved Y223 in TM5 is located near 
the contact surface with TM6, but faces the lipid bilayer at the level of 
the polar head groups (see below). Tryptophan-161 in TM4 is hydrogen 
bonded to Asp-78 in TM2, a residue that is itself conserved as either 
Asp or Ser throughout the subfamily (88%). These residues face the 
hydrophobic interior of the lipid bilayer, an environment which favors 
strong H bonding, and this interaction may serve to fix the relative po- 
sitions of TM2 and TM4 that sequester TM3 within the bundle interior 
at this level. 

The thermal factors reveal that the extracellular domain, consisting of 
the N-terminal sequence and the extracellular links El-E3, is the most 
highly ordered region of the structure in the crystal (Fig. 4). Together 
with the extracellular ends of the TM helices, the extracellular domain 
forms the binding pocket for the 1 1-cis retinal chromophore, shown as a 
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space-filling model in Fig. 3. The chromophore is covalently bound to the 
protein through a protonated Schiff base linkage at K296 in TM7, with a 
counterion at E113 in TM3 (Sakmar et al., 1989; Zhukovsky and Oprian, 
1989; Nathans, 1990). The covalent bond between the chromophore 
and protein and the K296/E113 interhelical salt bridge, located in the 
hydrophobic interior of the protein, play crucial roles in determining 
both the wavelength of maximum absorbance and the extreme stability 
of the resting (“off”) state of the receptor (Menon et al., 2001). The 
latter point is appreciated from the astonishingly low rate of spontaneous 
activation of rhodopsin, one event every 2000 years, as estimated from the 
thermal noise of the photoreceptor (Burns and Baylor, 2001). Moreover, 
rupture of the K296/E113 salt bridge is apparently a requirement for 
reaching the active state of rhodopsin (Rao and Oprian, 1996). 

One wall of the chromophore binding pocket is formed by beta strands 
£3 and £4 of extracellular loop E2. A highly conserved disulfide bond be- 
tween Cys-110 and Cys-187 serves to firmly anchor ß4 to the C-terminal 
end of TM3. If the chromophore were to enter and exit the binding 
pocket from the extracellular, surface, this rigidly held 63/84 wall would 
be required to move. Other salient features of the extracellular surface 
include the carbohydrate residues linked to Asn-2 and Asn-15 (S1 and 82, 
Fig. 3) and the fingerlike loop projecting away from the bundle. The gly- 
cosylation at Asn-15 appears to have a structural role, because mutations 
at Asn-15 produce rhodopsins defective in G-protein activation (Kaushal 
et al., 1994), and deglycosylation at this site produces structural changes 
at the cytoplasmic surface of rhodopsin as detected by SDSL (Zhang, 
C., Ridge, K, Khorana, H. G. and Hubbell, W. L., unpublished results). 
As will be discussed further below, the projecting loop, defined as the 
sequence between Pro-12 and Pro-27, may serve to stabilize the unique 
structure of the disk membrane. 

The cytoplasmic surface of rhodopsin is of particular interest because 
C2, C3, and H8 form the recognition /interaction surface for the G pro- 
tein (Fig. 2B). Thus, the activating conformational switch must be ul- 
timately expressed as structural changes at the cytoplasmic surface. As 
can be seen in Fig. 4A, the interhelical links C2 and C3 are the most 
disordered and hence uncertain regions in the structure (aside from 
the C-terminal tail). The maximum B factor in each of the interconnect- 
ing links increases with length, and the B factors in the transmembrane 
helices rise sharply as the cytoplasmic links are approached. As noted 
above, a number of residues in C3 could not be modeled at all, and in 
the B molecule this is also the case in C2. This suggests plasticity in C2 
and C3, and perhaps enhanced dynamics in solution. The axis of helix 
H8 lies approximately perpendicular to that of the TM helices at the 
cytoplasmic surface. The helix itself is amphipathic with the nonpolar 
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surface exposed, suggesting that this surface is solvated by the lipid bi- 
layer in the native environment. Arginine-314 is ideally positioned to 
interact with negatively charged lipids in this model. 

The C-terminal tail of rhodopsin extends from residue 324 to residue 
348 and is relatively poorly determined in the crystal structure (thermal 
factors >50 in the A chain, and notresolved in the B chain). From 324 to 
330, the C-terminal tail curls back over H8. Residues 331-333, near the 
C-terminus of TM7, could not be modeled. From 334 to 338 the chain 
passes over the end of TM2, turns, and finally extends in the direction 
of TM4 but away from the body of the helix bundle. 

In the following section, the average structure of the cytoplasmic do- 
main in solution, as seen by SDSL, sulfhydryl reactivity, and disulfide 
cross-linking kinetics, is compared with the crystal structure. In addi- 
tion, the dynamics of C1-C3 and H8 are evaluated with SDSL. 


III. STRUCTURE AND DYNAMICS OF RHODOPSIN IN SOLUTIONS OF DODECYL 
MALTOSIDE: THE CYTOPLASMIC SURFACE IN THE INACTIVE STATE 


Cysteine-scanning mutagenesis, involving more than 100 mutations, 
has been systematically carried out through C1-C3, the cytoplasmic ter- 
minations of TM1-TM7, H8, and the C-terminal tail. In addition, more 
than 40 pairs of cysteines have been introduced at the cytoplasmic face. 
With these mutants as a basis set, three classes of experiments have 
been carried out, namely SDSL, sulfhydryl reactivity, and disulfide cross- 
linking kinetics. A global comparison of the results provides a unique 
view of the solution state, its dynamics, and its correlation with the crystal 
structure. By solution state is meant, in all cases, rhodopsin solubilized 
in dodecyl maltoside (DM) micelles. The measured functional proper- 
ties of rhodopsin, namely transducin activation (Resek et al., 1993) and 
phosphorylation by the rhodopsin kinase (Thurmond et al., 1997), are 
conserved in this detergent, and it is presumed to be a reasonable ap- 
proximation to the bilayer environment. 

Before discussing the results from these methods, a brief review of 
the measured quantities and interpretations for each, as applied to 
rhodopsin, is presented. 


A. Overview of Methods for Exploring the Solution Structure of Rhodopsin: 
Site-Direcied Spin Labeling, Sulfhydryl Reactivity, and Disulfide Cross-Linking 
1. Site-Directed Spin Labeling 


The developing methodology and expanding applications of SDSL 
have been chronicled in a series of reviews, and these provide access 
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Fic. 5. (A) Reaction of (1-oxy1-2,2,5,5-tetramethylpyrroline-3-methyl) methanethio- 
sulfonate to generate the nitroxide side chain R1. (B) Simulated first-derivative EPR 
spectra for a nitroxide showing the characteristic changes in line shape that accompany 
changes in correlation time for isotropic motion. These are conveniently categorized in 
terms of the central linewidth(6). 


to the original literature in the field (Hubbell and Altenbach, 1994; 
Hubbell et al., 1996, 1998, 2000; Feix and Klug, 1998). In the method, a 
substituted cysteine residue is reacted with a selective nitroxide reagent 
to generate a nitroxide side chain, the most common of which is desig- 
nated R1 (Fig. 5A). The electron paramagnetic resonance (EPR) spec- 
trum of the RI side chain can be analyzed in terms of three quantities 
relevant to examining the solution structure. These are (1) the mobility 
of the side chain; (2) the accessibility of the nitroxide to paramagnetic 
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reagents free in solution; and (3) the distance between the nitroxide and 
another paramagnetic center in the protein, either another nitroxide or 
a bound metal ion. 

The accessibility to paramagnetic reagents is measured by the exper- 
imental “accessibility parameter” II, a quantity directly proportional to 
the collision frequency of the nitroxide with the reagent. As is intuitively 
reasonable, IT is directly proportional to the exposure of R1 on the sur- 
face of the protein. The commonly used paramagnetic reagents include 
Os and the metal ion complex NiEDDA. Because oxygen has sufficient 
solubility in both membrane interiors and water, I1(Og) is useful to ex- 
plore the exposure of R1 on both hydrophobic and hydrophilic surfaces 
of a membrane protein. 

It has been shown that Il(Os) for Rl is in fact proportional to the frac- 
tional solvent accessibility ( fsa) of the native side chain at the same site 
computed from the corresponding crystal structure (Isas et al., 2002). 
The sequence dependence of the solvent accessibility, measured by ei- 
ther fsa or TI (O9), is a “fingerprint” for a protein fold. For example, sol- 
vent accessibility is periodic through regular secondary structure, and 
the period and phase of the function identify the type of secondary 
structure and its orientation in the protein, respectively. In nonregu- 
lar secondary structure encountered in loops, the solvent accessibility is 
not necessarily periodic, but the functional dependence on sequence re- 
mains characteristic of the fold. Thus, comparison of fsa computed from 
a crystal structure and II(Os) determined experimentally for the protein 
in solution is a convenient and efficient way of comparing the solution 
and crystal structures. This will be the method used below for rhodopsin. 

Unlike the case for oxygen, NiEDDA has limited solubility in a bi- 
layer interior and can be used together with II(Os) to distinguish 
RI residues facing lipid from those facing water. Such discrimination 
provides a convenient means of locating sequences that cross mem- 
brane/aqeuous interface. For example, the contrast parameter, defined 
as ® = In [TI(O2g)/T(NiEDDA) ], changes sharply for an R1 residue as 
it is moved along the outside surface of a helix in the region where it 
crosses the interface between lipid and water. For a relatively diffuse 
interfacial region, the simple ratio of accessibility parameters (e®) is 
employed. In addition, ® isa function of depth within the bilayer and can 
be used to estimate the immersion depth of RI in a bilayer (Altenbach 
et al. 1994). 

Two RI side chains in a protein, or an Rl side chain and a paramag- 
netic metal ion in an engineered site, interact by a distance-dependent 
magnetic dipolar interaction. For the usual case where there is a distri- 
bution of interspin distances, the dipolar interaction leads to an EPR 
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spectral broadening that can be analyzed in terms of the average dis- 
tance and the distribution. The measurements can be made under 
physiological conditions (Altenbach et al., 2001a; Voss et al., 1995), 
and the effective distance range up to about 20 A is ideal for map- 
ping helix proximity relationships in membrane proteins. Recently de- 
veloped time-domain EPR techniques extend this range beyond 50 A, 
but the measurements must be carried out at low temperatures (Eaton 
et al., 2001). 

Rotational motion of a nitroxide modulates the anisotropic electron- 
nuclear magnetic dipolar interaction, giving rise to electron relaxation 
that affects the EPR spectral line shape. At X-band frequency, the spectra 
are sensitive to motions with correlation times in the range of 107! < 
Te < 1077, but also reflect the anisotropy of the motion. Figure 5B shows 
simulated EPR spectra for isotropic motion in the fast, intermediate, 
and slow motional regimes and illustrates the high sensitivity of the line 
shape to motional rate. 

The motion of the RI nitroxide in a protein has contributions from the 
overall tumbling of the protein, the internal motions of the side chain, 
and fluctuations in the backbone structure. For membrane proteins such 
as rhodopsin, the correlation time for molecular tumbling is slow on the 
EPR time scale defined above and can be ignored. The internal motion 
of the R1 side chain is due to torsional oscillations about the bonds that 
connect the nitroxide to the backbone, and the correlation times for 
these motions lie in the nanosecond regime where the EPR spectra are 
highly sensitive to changes in rate. 

The motion of Rl in helices has been studied in detail, and this is 
the case relevant for analysis of rhodopsin. At helix surface sites where 
RI has no interactions with other side chains or main-chain atoms, the 
motion is anisotropic and can be accurately modeled by a single or- 
der parameter (S) and effective correlation time (te) (Columbus et al., 
2001). This simple anisotropic motion is expected to be the same at 
all helical surface sites unless modulated by direct interactions of Rl 
with other groups in the protein and/or by local backbone fluctua- 
tions. Interactions of RI with the environment and local backbone 
fluctuations are qualitatively distinguishable by their opposite effects 
on motion: the former reduces and the latter increases the mobility 
relative to a noninteracting reference on the surface of a rigid helical 
segment. 

The motion of a nitroxide side chain in terms of correlation times and 
order parameters can be deduced from spectral simulations (Budil ei al., 
1996). On the other hand, a simple semiempirical quantity, the “scaled 
mobility” (M,), suffices for many purposes (Hubbell et al., 2000). The 
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scaled mobility is a function of both rate and order and is defined as 


where 6 is the central resonance linewidth indicated in Figure 5B, and 
the subscripts m and i refer to the most mobile and immobile residues 
encountered in proteins. The M, value for Rlinarandom coilis M, sl, 
at a noninteracting helix surface site on a rigid helical backbone is 
AM. ~ 0.39, and for RI at a buried site on a rigid helical segment is 
M, © 0. Thus, in an Rl scan along a helical sequence, Ms is periodic 
in position because of periodic modulation of the tertiary interaction 
strength. 

Contributions from backbone fluctuations will increase M, values for 
both surface or buried sites, relative to a rigid reference helix. The back- 
bone contribution is thus reflected in the average value of M, along one 
turn of the helix, the mean scaled residue mobility ((M,)). Alternatively, 
if noninteracting helix surface sites can be identified, deviations of M, 
from the rigid helix reference value may be taken as a direct measure of 
backbone contributions. Noninteracting helix surface sites are readily 
recognized by their diagnostic EPR spectral line shape (Mchaourab etal., 
1996; Columbus et al., 2001). In the following sections, M, and (Ms) will 
be employed to map backbone dynamics for rhodopsin in solution. 


2. Sulfhydryl Reactivity and Disulfide Cross-Linking 


The solvent accessibility of particular sites may be assessed by the 
reaction rate of substituted cysteines with the sulfhydryl reagent 4,4’- 
dithiopyridine (4-PDS) (Grassetti and Murray, 1967; Chen and Hubbell, 
1978) (Fig. 6A). The products of the reaction are a mixed disulfide 
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Fic. 6. (A) Reaction of 4,4’-dithiopyridine with cysteine to produce a mixed disulfide 
and 4-thiopyridone; (B) the subsequent reduction of the mixed disulfide with dithiothre- 
itol (DTT). Rate constants kpps and kprr characterize these second-order reactions. 
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between cysteine and 4-thiopyridine, and 4-thiopyridone (4-TP). The 
reaction rate may be conveniently followed by appearance of the 4-TP 
absorbance at 324 nm, and the rate constant for the reaction, kpps, taken 
as a measure of relative accessibility. The mixed disulfide may be subse- 
quently cleaved with dithiothreitol (DTT) to yield the original cysteine 
residue and 4-TP as shown in Fig. 6B. The rate of this reaction may again 
be monitored by the appearance of 4-TP absorbance, and the rate con- 
stant, kprr, provides a second measure of solvent accessibility. Both ksu 
and kprr have been used as measures of solvent accessibility in rhodopsin 
to complement those of II (O9) and TI(NiEDDA). 

The k and II values give different views of residue “accessibility” in 
a protein. II values, because of the nature of the measurement, reflect 
a state in proportion to its equilibrium population. Thus, rare fluctua- 
tions in the protein structure that transiently expose a buried residue will 
not be seen. On the other hand, the covalent reaction of 4+PDS with a 
buried cysteine during a fluctuation will trap the state, and the event will 
be counted and added to previous such events. Thus, the sulfhydryl reac- 
tivity method is an “integrating” method that can reveal low-frequency 
fluctuations much like hydrogen exchange. A comparison of relative 
II and k values thus may provide information on the existence of low- 
frequency structural fluctuations (Altenbach et al., 1999a,b), and this 
point will be discussed further below. 

In SDSL, proximity relationships are inferred from direct distance 
measurements between R1 residues. A complementary proximity mea- 
surement using the same double cysteine mutants as for SDSL is the 
spontaneous rate of disulfide formation, measured by the rate constant 
kss (Falke and Koshland, 1987). The spontaneous rate of oxidation de- 
pends on many features of the structure, including (1) the distance and 
relative orientation of the residues, (2) the flexibility of the sequences 
containing the cysteines, and (3) the effective p K, of the residues as de- 
termined by local electrostatic potentials. A comparison of distances 
from SDSL with kss can provide information on the structure as well as 
dynamics. For example, relatively large values of both kss and the cor- 
responding interspin distance suggest that the cysteines in question are 
not in proximity on the average, but that one or both are in flexible se- 
quences and come into proximity during rare fluctuations in structure. 


B. Structure of the C-Terminal Tail in Solution 


As noted above, the C-terminal tail is poorly determined in the crys- 
tal structure. In order to examine the structure in solution, cysteine 
residues were substituted, one at a time, at sites in the C-terminal tail 
indicated in Fig. 7A (325, 326, 328, 331, 332, 335-340) (Cai et al., 1997). 
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A C-Terminal Domain 











Fıc. 7. (A) Structural model of rhodopsin showing the location of cysteine substitution 
mutants in the C-terminal tail (325, 326, 328, 331, 332, 335-340). Mutants at 331 and 332 
are not represented because these sites were not modeled in the crystal structure. Also 
shown are sites in C3 (dark spheres at 242, 245, 246, 249) and 65 in C1 discussed in the 
text. Dotted lines indicate sequences not modeled in the crystal structure. (B) Top row: 
EPR spectra of 326R1 and 338R1, representing sites in the proximal and distal portions of 
the C-terminal tail, respectively; center and bottom rows: the two components of a and £, 
respectively, resolved by spectral subtraction. In 338R1, the immobilized component £ is 
minor and is nearly invisible in the experimental spectrum. 


The cysteine residues were derivatized to introduce the R1 side chain 
(Fig. 5A), and the EPR spectra analyzed in terms of R1 mobility (Langen 
et al., 1999). The EPR spectra each revealed two components reflecting 
RI populations of different mobility. Using simulation and subtraction 
techniques, the two spectral components (œ and ß) were resolved and 
analyzed separately (see Fig. 7B for examples). In each case, the most 
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mobile component (a) has effectively isotropic motion with correla- 
tion time in the range 0.3-0.5 ns, similar to RI in a unfolded protein. 
The more immobilized component (ß) varies in mobility from site to 
site, reflecting an ordered secondary and tertiary structure. Thus, the 
C-terminal tail is in equilibrium between an ordered and disordered 
conformation. The equilibrium constant, however, is strongly position 
dependent. For 325R1, 326R1, and 328R1, the ordered state strongly 
dominates (~5% disordered), and it may be concluded that this se- 
quence has a stable configuration. 

In the crystal structure, the 325-328 sequence folds back and packs 
against H8 in a conformation where the side chains of 325R1, 326R1, and 
328R1 are solvent exposed (Fig. 7A). The RI spectra are consistent with 
this assignment. Additional support for an interaction of the 325-328 
sequence with H8 will be discussed below. 

In the sequence beyond 331, there is a dramatic increase in the popula- 
tion of the highly mobile, disordered component. For 338, for example, 
the disordered state accounts for ~70% of the population (Fig. 7B), in- 
creasing to essentially 100% at 340. In the crystal structure, the sequence 
from 331 to 340 is modeled to curl back over the top of TM2 in contact 
with the cytoplasmic surface of the helix bundle. Data from disulfide 
cross-linking and R1 magnetic dipolar interactions in double mutants 
argue against this conformation in solution (Cai et al., 1997). One such 
mutant investigated was 338C/65C. The crystal structure model would 
place the sulfur atoms of the two cysteines in at an optimal distance and 
orientation for disulfide formation (see Fig. 7A), but in fact essentially 
no cross-linking was observed. Given the dynamic properties of this do- 
main observed in solution, it is extremely unlikely that these residues are 
in proximity. On the other hand, when one cysteine was placed at 338, 
and the second scanned through 240-250 in C3, local maxima in cross- 
linking rates were found between 338C and residues 242C, 245C, and 
248C. In either the crystal structure or the proposed helical extension 
of TM6 based on SDSL (see Section III,E below), these residues lie 
on the exterior surface of the molecule, facing the aqueous solution 
(Fig. 7A). In addition, some of the double cysteine mutants in the latter 
experiment were derivatized with spin labels, and it was found that the 
strongest spin-spin interaction occurred between 338C and 245C and 
246C. These residues lie on the same face of TM6 defined by the residues 
marking local maxima in cross-linking with 338C. Thus, it is most likely 
that in solution a dynamic C-terminal sequence from 331 on extends to- 
ward TM6 rather than curling around over the top of TM2. Interestingly, 
removal of the palmitoylation sites has little effect on the structure or 
dynamics of the C-terminal tail, including the structure from 325 to 328 
that lies close to these sites (Langen et al., 1999). 
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C. Structure of Cl and Adjacent Sequences in TMI and TM2 


C1 (65-71) is the shortest of the cytoplasmic TM links and does not 
contain residues essential for transducin activation, although mutations 
K67C and L68C do reduce transducin activation to ~30% of that for 
the wild-type protein (Klein-Seetharaman et al., 1999). Figure 8A iden- 
tifies the sequence through which single cysteine substitution mutants 
were prepared, relative to the crystal structure (Klein-Seetharaman et al., 
1999). Each of these mutants was derivatized to introduce R1, and the 
EPR spectra analyzed in terms of I (O2), II(NiEDDA), and R1 mobility 
(Altenbach et al., 1999b). 

As discussed in Section III,A,1, comparing the sequence dependence 
of fsa and TI (Os) is an efficient means of comparing a crystal structure 
with features of the solution structure derived from SDSL data. Figure 8B 
shows fsa for both the A and B molecules of the crystal (upper panel) 
and II(Os) (center panel) from 60 to 75, which includes parts of TM1 
and TM2 as well as Cl. The A and B molecules have similar structures 
in Cl (Fig. 4B), and this is reflected in the similarity of fsa throughout 
the sequence, except for small differences around 63 and 64 due to the 
absence of the C-terminal domain in the B chain. 

The pattern of experimental solvent accessibilities in solution mea- 
sured by II(Os) clearly mirrors that of fsa, indicating the similarity of 
the crystal and solution structures for the C1 linker. The correspondence 
between solvent accessibility and the crystal structure is illustrated in 
Fig. 8A by the Cy—Cg bonds, which are shaded according to accessibility 
(see figure legend). As expected, residues that project into regions of 
extensive tertiary contact lie at local minima in TI (Os), whereas residues 
that are fully solvent exposed lie at local maxima. The periodicity of 
the TM1 (60-64) and TM2 (72-75) sequences is extended through 
Cl, as observed in other helical hairpin motifs with short loops (Isas 
et al. 2002). The accessibility to collision with the highly polar reagent 
Ni(EDDA) clearly shows that loop residues 64-66 and 70 are solvent 
exposed (Fig. 8B, center panel). Residues 68 and 69 in the loop point 
into the fold of the protein and are hence sequestered from solvent to 
a large extent, as readily identified by both II(Os) and TI(NiEDDA). 

Both ksy and kprr have been employed to investigate the structure 
of Cl, but quantitative data at more sites were obtained with kprr 
(Klein-Seetharaman et al., 1999). Figure 8B (lower panel) shows 
values of kprr for sites in Cl and adjacent helices. Clearly, kprr and 
II(NiEDDA) identify the same sites as being highly solvent exposed, 
although the relative values differ. 

The crystal and solution structures in the vicinity of Cl may be 
compared in another way, namely the determination of proximity 
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Fic. 8. (A) Structural model of rhodopsin showing the location of cysteine substitution 
mutants in Cl and adjacent sequences in TM1 and TM2 (60-75). Individual residues 
are represented by the Cy—Cg bonds in order to indicate that direction in which the 
side chains project. To illustrate the correspondence between experimental values of 
solvent accessibility and structure, the bonds are coded according to local maxima (white), 
minima (black), or intermediate (gray) values of TI (O92). (B) fsa values computed from 
the crystal structure for both A (dark trace) and B (light trace) molecules (upper panel), 
T1(Og) (dark trace) and II(NiEDDA) (light trace) for Rl at each site (center panel), and 
kprr for each site (lower panel). The shaded vertical bars mark residues in TMl and TM2, 
as indicated. 
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Fic. 9. Proximity relationships for residues in C1 relative to 316 in H8. Example of 
RI distance mapping in rhodopsin. For each distance measurement, only two RI side 
chains were in the protein, one fixed at the reference site 316, and the other at a site in 
the sequence 55-75. The RI side chains were modeled based on crystal structure data 
with energy minimization subject to the experimentally determined distance constraint 
(shown). In each case, the measured distances in solution were in good agreement with 
those expected from the rhodopsin crystal structure. Substituted cysteine residues 65 and 
68 most rapidly formed disulfide cross-links with the reference cysteine at 316 in H8. This 
is indicated by the dark bars connecting the potential disulfide partners. 


relationships based on direct distance measurements between pairs of R1 
residues, or on determination of kgs, the rate constant for spontaneous 
disulfide formation. Figure 9 shows an example of results obtained from 
direct distance measurements with SDSL (Altenbach et al., 2001c). In 
this experiment, a reference nitroxide was placed at residue 316 in H8, 
and a second nitroxide scanned through the sequence 60-75. The dis- 
tances determined between each pair in solution were consistent with 
the topology of the crystal structure and known conformations of the 
RI side chain (Langen et al., 2000). Using the same pairs of cysteines, 
proximity was estimated by the rate of disulfide formation, and this 
was again consistent with the crystal structure (Klein-Seetharaman et al., 
2001) (Fig. 9). 

Collectively, results from sequence-correlated solvent accessibility, di- 
rect distance measurement, and disulfide cross-linking rates all suggest 
that the crystal and solution structures are similar in the C1 region. 
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D. Structure of C2 and Adjacent Sequences in TM3 and TM4 


C2 constitutes part of the transducin interaction domain (Fig. 1B). 
Figure 10A shows the sequence in C2, TM3, and TM4 (136-155) that 
has been investigated by cysteine scanning mutagenesis and SDSL (Ridge 
et al., 1995; Farahbakhsh et al., 1995). Figure 10B (upper panel) shows 
the fractional solvent accessibilities for residues in the 136-155 sequence 
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Fic. 10. (A) Structural model of rhodopsin showing the location of cysteine substitu- 
tion mutants (136-155) in C2 and adjacent sequences in TM3 and TM4. (B) fsa values 
computed from the crystal structure for both A and B molecules (upper panel), and 87! 
(the inverse central linewidth) (bottom panel) for R1 at each site. The shaded vertical 
bars mark residues in TM3 and TM4. 
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for both the A and B chain. In the B chain, a structure was not assigned 
for 140-149, presumably because of disorder. In the A chain, the C2 
structure is complete, but different from that in the B chain in seg- 
ments where they can be compared. These properties suggest plasticity 
in C2. 

Experimental solvent accessibilities [II(Os)] are not available for C2, 
so a direct comparison between the crystal and solution structures cannot 
be made on this basis. However, R1 side-chain accessibility and mobil- 
ity are closely correlated (Isas et al., 2002), and Fig. 10B (lower panel) 
shows a plot of R1 mobility, measured by the inverse linewidth, AT. for 
comparison with fsa. In the TM4 helix from 153 to 155, the patterns of 
fsa and 6-1 are similar and reflect the helical structure observed in the 
crystal. However, from 144 to 150, ö-!hasa period of 2, characteristic 
of a £-like structure that only partially agrees with the fsa pattern, and 
this section appears to have a different structure in solution than it does 
in the crystal. Interestingly, this section overlaps with that of unassigned 
structure in the B molecule. Along the sequence 139-143, a gradient of 
increasing sa is also reflected in ö”!. A regular helical structure was in- 
ferred for residues 130-142 in solution from patterns of disulfide bond 
formation between engineered cysteines (Yu and Oprian, 1999). 


E. Structure of C3 and Adjacent Sequences in TM5 and TM6 


The cytoplasmic interhelical linker C3 is a key recognition/binding 
site for transducin (Fig. 1B) as well as for the rhodopsin kinase and ar- 
restin. In the crystal, the structure of this important segment is poorly 
determined because of disorder in the lattice (Fig. 4), presumably 
corresponding to flexibility in the structure. Figure 11A shows rib- 
bon models for the C3 region based on the crystal structure (right 
panel) and a tentative solution structure based on cysteine scanning 
mutagenesis and SDSL data (left panel) (Yang et al., 1996b; Altenbach 
et al., 1996). Figure 11B shows data for fsa (upper panel) and II values 
(lower panel) for the sequence 225-256 which encompasses parts of 
TM5, TM6, and the whole of C3 (231-251). In the B molecule of the 
crystal, a long stretch of amino acids, residues 227-243, could not be 
modeled. The A molecule is more highly ordered, but is still missing 
236-240. 

In the sequences corresponding to TM5 (225-230) and TM6 (252- 
256), fsa for the A molecule reflects the expected helical periodicity, as 
does that for the residues of the B chain that are modeled. For residues 
in these regions that can be compared, the ba and Dia) data are 
in good agreement, suggesting a similar structure in the crystal and in 
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Fic. 11. (A) Structural models of rhodopsin showing the location of cysteine substitu- 
tion mutants in C3 and adjacent sequences in TM5 and TM6 (residues 225-256). The left 
panel is based on the crystal structure (1HZX, A chain), and the right panel on SDSL data 
for the solution structure. In each panel, both top (upper) and side (lower) views of the 
structure are shown. Individual residues in the sequence are represented by the Cy—Cg 
bonds in order to indicate the direction in which the side chains project. The bonds are 
coded according to local maxima (white), minima (black), or in-between (gray) values 
of TI (O2). (B) fsa values computed from the crystal structure for both A and B molecules 
(upper panel), and TI (O9) and II (NiEDDA) for RI at each site (lower panel). The shaded 
vertical bars mark residues in TM5 and TM6, as indicated. 
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solution [the TI (O2) points for 229 and 254 are missing because of lack 
of reactivity of the corresponding cysteine residues with the spin label]. 
In addition, the pattern of cross-linking observed in cysteine and disul- 
fide scans is consistent with helical secondary structure from 218 to 225 
(Yu and Oprian, 1999). Direct distance measurements between pairs 
of Rl side chains, one in TM3 and a second in the sequence 247-252 
in TM6, are also compatible with a regular helical structure for TM6 
near the cytoplasmic surface (Farrens et al., 1996). These latter mea- 
surements will be discussed in Section V,A in relation to photoinduced 
conformational changes. 

In the solvent-accessible C3 sequence (231-251), the crystal structure 
is incomplete and different between the A and B molecules (Fig. 4B). 
On the other hand, the II data in solution suggest a simple helix-turn- 
helix motif for C3, formed from extensions of the TM helices, as shown 
in Fig. 11A (right panel). The helical periodicity of both TI (O) and 
TI(NiEDDA) from 231 to 239 (TM5 extension) and from 241 to 250 
(TM 6 extension) is clearly seen in the data (Fig. 11B, lower panel). The 
interruption of the regular periodicity (a phase shift) between these two 
sequences at 240 is due to a short turn, probably involving three residues. 
The C,—Cg bonds that mark the location and direction of spin-labeled 
side chains in Fig. 11A are shaded according to T (Oz) values (see figure 
legend). The excellent correspondence between I (Og) values and the 
proposed helix-turn-helix motif is obvious (right panel, top view). That 
is, residues that project into the protein interior are at local minima 
in II(Os) and residues fully exposed to solvent are at local maxima in 
TI (O2). On the other hand, the crystal structure does not correspond as 
well to the experimental solvent accessibilities in the C3 sequence 231- 
235 (for example, 233 is fully exposed in the structure, but Il(Os) for 
this site is near a local minimum), and 236-240 are missing and cannot 
be compared. In the crystal structure model, TM5 terminates within the 
bilayer interior. 

The structure assignment in C3 based on SDSL data must be consid- 
ered tentative, because it is based on periodicity alone. In addition, it is 
not possible to predict the topography of the motif. That is, the putative 
helical hairpin could project straight upward as regular extensions of 
the TMs, as shown in Fig. 11A (right panel), or the helical extensions 
could bend toward the membrane. However, the fact that the oscillatory 
TI (O2) and II(NiEDDA) functions are in phase demonstrates that the C3 
structure is homogeneously solvated by an aqueous environment, and 
none of the residues enter the membrane phase (Hubbell et al., 1998; 
Oh et al., 1996). Future distance mapping experiments will allow a more 
definitive structure and topographical description of C3. 
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F. Structure of H8 and Adjacent Sequence in TM7 


Figure 12A shows the structural model of rhodopsin marking the 
sequence 306-321 through which cysteine scanning mutagenesis was 
carried out for investigation of sulfhydryl reactivity (Cai et al., 1999a) 
and SDSL analysis (Altenbach et al., 1999a). This sequence spans the 
C-terminal end of TM7 (306-309) and helix H8 (312-321). The H8 
helix is of particular interest because it is apparently involved in direct 
interaction with transducin (Fig. 1B) as well as with the lipid bilayer. 

The structure of the H8 helix and the adjacent sequence in 306-311 is 
essentially identical in the A and B chains of the crystal lattice (Fig. 4B). 
The differences in the computed values of fsa between the A and B 
molecules along this sequence (Fig. 12B, upper panel) are simply due 
to the absence of the C-terminal domain in the B chain model, which in 
the A chain folds back along H8 and partially occludes side chains from 
about N311 to T320. 

Data for experimental solvent accessibilities [TI(Os)] for R1 side 
chains at 308-316 are reproduced in Fig. 12B (center panel), where 
they can be compared with calculated fsa values. As can be seen, the 
oscillatory behavior of fsa from 308 to 315 in the A chain is reproduced 
by II(Og) in both period and phase. The difference between relative 
fsa and TI values at 316 could be due to a real structure difference, or 
to differences in the molecular size of the native (cysteine) and R1 side 
chains (Isas et al., 2002). 

Evidence was presented in Section III,B that in solution the sequence 
325-328 in the C-terminal tail is ordered and compatible with the crystal 
structure in that region, but beyond 328 the tail is largely disordered in 
solution. This model is consistent with anumber of observations regard- 
ing the H8 sequence. For example, the extremely low chemical reactivity 
of a cysteine residue substituted at position 318, and the modest reac- 
tivity of a cysteine at 319, both on the solvent-exposed surface of H8, 
can be accounted for by the interaction with 325-328 (Fig. 12A). On the 
other hand, site 315, also on the solvent-exposed surface of H8, shows 
high reactivity as well as high II values. Interestingly, residue 315 lies 
just under residue 328 of the C-terminal tail, the position marking the 
end of the ordered state in the solution structure. The EPR spectral line 
shape for 315R1 is consistent with this environment. In the absence 
of the C-terminal domain, 315 should be a helix surface site with a 
single dominant component (Columbus et al., 2001). However, the spec- 
trum of RI (Fig. 13A, black trace) shows two distinct spectral compo- 
nents, one ofwhich corresponds to a strongly immobilized population of 
R1 (oi, and the other to a dynamic mode similar to that expected for 
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Fic. 12. (A) Structural model of rhodopsin showing the location of cysteine substitu- 
tion mutants in TM7 (306-309) and H8 (310-321). Cy—Cg bonds indicate the direction in 
which the side chain projects. The bonds are shaded according to local maxima (white), 
local minima (black), and intermediate (gray) in II(Os). Cysteine residues that were 
unreactive to the spin label reagent and 4-PDS are marked by black spheres at the corre- 
sponding Cy. (B) fsa for the A and B molecules (upper panel), TI (O2) and II (NiEDDA) 
(center panel), and kpps (lower panel) for the sequence 306-321. 
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Fic. 13. (A) EPR spectra for 315R1 in rhodopsin (dark trace) and 72R1 in T4 lysozyme 
(light trace). Component o corresponds to an immobilized population of R1 side chains, 
while £ corresponds to a population with a dynamic mode similar to a solvent-exposed 
helix surface site, such as 72R1 in T4 lysozyme. (B) EPR spectra for 252R1 and 314R1 in 
rhodopsin. 


a noninteracting, helix surface site (£). For comparison, the spectrum 
of a helix surface site in T4 lysozyme is also shown. A likely origin of the 
immobilized state is interaction of 315R1 with the end of the ordered 
sequence in the C-terminal domain. The fact that there are two states 
suggests an equilibrium mixture of either RI rotamers, or possibly two 
conformations of the C-terminal domain. 

The H8 helix is amphipathic, and given its location in the crystal 
structure, it is logical to assume that the hydrophobic surface is solvated 
by the lipid bilayer in the native membrane (or by the micelle in micellar 
solutions). There is direct evidence that this is the case for rhodopsin 
in micelles of dodecyl maltoside. For example, cysteine residues on the 
exposed hydrophobic face (M317, T320, L321) are highly unreactive 
to either 4-PDS (Fig. 12B, bottom panel) or to the spin label reagent 
(Altenbach et al., 1999a). This would be expected for cysteine facing 
the hydrophobic interior of the micelle because of both the suppressed 
ionization of SH groups and the low solubility of the reagents employed 
in a hydrophobic medium. Residue 314 lies on an exposed surface of 
H8 at the boundary between the polar and nonpolar faces. The high 
and low values for TI (Op) and II(NiEDDA) clearly place 314R1 in the 
hydrophobic micelle interior, accounting for the low value of kpns at this 
site (Fig. 12B, lower panel). 
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Additional direct evidence for the location of 314 on a lipid-solvated 
helical surface is provided by the spectrum of 314R1, which has a line 
shape characteristic of other lipid-facing helix surface sites in rhodopsin 
such as 252R1 (Fig. 13B). Although, the native side chain at 314 is argi- 
nine, the a-carbon is located on the hydrophobic face of H8, and the 
side chain bends upward so that the positively charged atoms would be 
located in the polar head-group region of the bilayer, while the nonpo- 
lar carbons of the side chain remain lipid-solvated. The uncharged and 
relatively hydrophobic R1 side chain would project directly into the hy- 
drophobic interior ofthe DM micelle, as indicated by all ofthe evidence 
outlined above. 

As for Cl, the global topology of the TM7—H8 sequence was mapped 
in detail by direct distance measurements between pairs of spin la- 
bels (Altenbach et al., 2001c) and by disulfide cross-linking (Klein- 
Seetharaman et al., 2001). For direct distance measurement using 
spin-spin interactions, a reference RI side chain was placed at site 
65 in Cl, and a second RI placed at each site in the sequence 
306-316. Figure 14 shows the pairs and indicates the interspin distances 





Fic. 14. Proximity relationships in H8 and TM7 relative to 65 in TM1. Examples of 
interspin distances measured in solution. In each distance measurement, only two RI 
side chains were in the protein, one on the reference site 65 and the other at a site in the 
sequence 306-319. The RI side chains were modeled based on crystal structure data with 
minimization subject to the experimentally determined distance constraints (indicated 
in Ä). In each case, the measured distance in solution was in good agreement with that 
expected from the rhodopsin crystal structure. Substituted cysteine residues 315, 316, and 
319 most rapidly formed disulfide cross-links with a cysteine at 65 in TM1. The potential 
disulfide bonds are indicated as gray dashed lines. 
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measured. In each case, the distances were in good agreement with 
predictions based on the rhodopsin crystal structure and the struc- 
ture of the RI side chain. With the same double-cysteine mutants 
used for spin labeling, the spontaneous rate of disulfide forma- 
tion (ks) was determined. The cysteine pairs with most rapid cross- 
linking, 65C/316C and 65C/315C, are in fact those expected from the 
rhodopsin structure. Surprisingly, essentially no cross-linking was found 
between 65C and 319C, although the interspin distance in 65R1/319R1 
was short (~9 A) and the same as for 65R1/316R1 where cross- 
linking was rapid. This interesting difference will be discussed further 
below. 

Collectively, all of the data obtained on the solution structure with 
SDSL, sulfhydryl reactivity, and disulfide cross-linking kinetics strongly 
support the conclusion that the structure of the TM7-H8 sequence in- 
vestigated is very similar in the crystal and micellar solution state. In so- 
lution, the H8 helix is sandwiched between the hydrophobic/aqueous 
interface on one side and residues 325-328 of the C-terminal tail on the 
other. 


G. Dynamics of C1-C3, H8, and the C-Terminal Tail 


The preceding sections highlighted the similarities and differences 
between the static crystal structure and the average solution structure 
of rhodopsin in the cytoplasmic domain. This section considers fluctua- 
tions in the structure of the cytoplasmic domain relative to the average. 
This is of interest, because dynamics sequences are often correlated with 
function, and that appears to be the case in rhodopsin. 

Backbone fluctuations with correlation times in the nanosecond 
regime are revealed in variations of the Rl residue scaled mobility, Ms, 
along the sequence (see Section III,A,1 for definition). Figure 15A (see 
color insert) shows a plot of M, versus sequence for C1—C3, H8, and 
adjacent sequences in the TM helices. Figure 15B shows the cytoplasmic 
surface of rhodopsin color-coded according to M, values. In this figure, 
C3 was modeled from the SDSL data, as in Fig. 9. 

There are several important features in Fig. 15. First, the sequence 
dependence of M, through C1, C2, C3, and H8 follows a pattern essen- 
tially identical to that of TI (O2) (compare with Figs. 8, 11, and 12). This 
is particularly obvious in the extended helical periodicity of both M, and 
II (O2) throughout the long C3 segment. This correlation is expected, 
because the most immobile (buried) and mobile (surface) residues are 
also the most inaccessible and accessible, respectively. Second, the high- 
est values of M, are reached in the C terminus, beyond 328, and in the 
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center of C3. As discussed above, the C-terminal sequence is dynami- 
cally disordered beyond 328, similar to an unfolded protein. Residue 
340R1 defines the highest mobility in the molecule and M, = 1. Sur 
prisingly, the central part of C3 has M, values in the range of 0.5-0.9, 
approaching those of the disordered C terminus. Thus, although the 
periodic variation of M,, (Og), and II(NiEDDA) all confirm a regular 
structure with period ~3.6, the helix is a dynamic structure, and clearly 
not tightly packed with the remainder of the structure. For comparative 
purposes, the mean scaled residue mobility through one turn of a C3 
helix segment (232-235) is (M,) = 0.63 compared to (M;) ~ 0.15 for a 
turn through a well-packed helix in annexin XII or T4 lysozyme (Isas 
et al., 2002; Mchaourab et al., 1996). The gradients in M, along the C3 
sequence suggest that the dynamic mode is fluctuation of backbone di- 
hedral angles rather than rigid-body helical motion, because the latter 
would simply add a constant contribution to all positions. 

It is interesting that the two most dynamic sequences identified by 
nitroxide scanning, C3 and the C-terminal domain, are also the most 
disordered in the crystal structure. Moreover, they are two of the most 
important functional domains of the molecule, both being involved in 
protein-protein interactions along the signal transduction pathway. This 
is in keeping with a growing body of data on other proteins that implicate 
dynamic sequences with protein-protein or ligand-protein interactions 
(Crump et al., 1999; Duggan et al., 1999; Bracken et al., 1999). 

Insufficient data are available to compute (M,) through complete he- 
lical turns of each TM segment, but a useful estimate of (M,) can be 
obtained as the pairwise average of buried and surface sites. On this ba- 
sis, TM1, TM2, TM6, TM7, and surface helix H8 are all relatively rigid, 
having (M,) % 0.1. The corresponding segments in TM4 and TM5 are 
nearly twice as large, suggesting greater flexibility. Interestingly, TM4 
and TM35 at the cytoplasmic face of rhodopsin have the largest surface 
area exposed to lipid, and thus have small areas of contact with pro- 
tein, perhaps accounting for the flexibility. Helices TM3, TM5, and TM6 
all have extensions into the aqueous phase of 1 or more turns. These 
extensions form a triad with a common core, and the mobility of R1 
residues facing this core (138, 139, 230, 234, 244, 247; see Fig. 15B) is 
significantly greater than expected for R1 at buried sites in a well-packed 
protein. Presumably, this is due in large part to the backbone motion of 
the TM6 extension in C3. 

For backbone motions to modulate M,, they must have correlation 
times shorter than about 30 ns (for X-band EPR spectroscopy); low- 
frequency modes with microsecond or longer correlation times cannot 
be characterized by M,. Correlation times in the microsecond regime 
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can be investigated by saturation transfer EPR spectroscopy (Thomas 
et al., 1976; Marsh and Horvath, 1989), but the competing rotational mo- 
tion of the protein complicates interpretation. However, low-frequency 
fluctuations that transiently expose an otherwise buried cysteine can be 
detected by a finite reaction rate of the cysteine with a sulfhydryl reagent 
in solution. Likewise, the formation of a disulfide bond between pairs of 
cysteines otherwise too far apart to react in a static structure can serve to 
identify transient states (Careaga and Falke, 1992). Because these reac- 
tions are essentially irreversible, each event is summed over time in the 
formation of product. In this way, even rare states of the protein can be 
detected, and their probability of occurrence quantified by the reaction 
rate constant. 

It is expected that sites buried within the protein should generally have 
low reactivity due to lack of accessibility. Sites 310 and 313 in the TM7- 
H8 loop region, identified by spheres at the a-carbons in Fig. 8B, are 
interesting exceptions. The very low accessibility to collision with either 
Os or NiEDDA (Fig. 12), and the low mobility (Fig. 15A) clearly demon- 
strate that residues 310R1 and 313R1 are buried in the protein interior, 
as predicted by the crystal structure ( fsa < 0.05 for both). Nevertheless, 
kpps is high, equal to that of some solvent-exposed sites. This suggests 
low-frequency fluctuations of the structure in the turn between TM7 and 
H8. This is further supported by a finite cross-linking rate between a cys- 
teine at 65 or 246 and a second cysteine in the turn sequence 310-312 
(Cai et al., 1999a). The distances between these residues (Cy—C,>10 A) 
are greater than the average distance for disulfides in known proteins 
(5-6 A) (Kosen, 1992). On the other hand, the flexibility is apparently 
limited to the vicinity of the turn, because cysteines 65 and 319, which 
are in close proximity as judged by spin-spin interaction between 65R1 
and 319R1, show no spontaneous cross-linking (Cai et al., 1999a). The 
Cl segment appears to be rigid on a low-frequency time scale as well, 
based on both kprr data (Fig. 8) and the paucity of cross-links formed 
between a reference cysteine at 316 and a second cysteine throughout 
C1 (Klein-Seetharaman et al., 2001). 


IV. LOCATION OF THE MEMBRANE—AQUEOUS INTERFACE AND THE 
STRUCTURE OF THE DISK MEMBRANE 


The crystal structure provides no direct information on the location 
of the membrane-aqueous interface relative to the protein structure, 
although its location can be approximately inferred from sequence infor- 
mation (see below). A direct experimental identification of the residues 
located in the interface is possible with SDSL using the e® function 
described in Section III,A,1. Figure 16 shows plots of e? for the R1 side 
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Fic. 16. Plots of e® versus sequence for the solvent-exposed sites in each of the loops 
investigated. The approximate position of the membrane-aqueous interface relative to 
the rhodopsin sequence is located at the transitions between white and grey background. 


chain at multiple positions throughout (only solvent-exposed sites are in- 
cluded in the analysis) (Altenbach et al., 1994). The function e? changes 
sharply when crossing a polarity boundary, and based on this criterion, 
the approximate location of the membrane-aqueous boundary (within 
one or two residues) is indicated in Fig. 16 for each sequence. 

Fig. 17A (see color insert) shows a ribbon model of the rhodopsin 
structure indicating the residues assigned to the interface in each helix 
by a sphere centered on the corresponding a-carbon. Also shown is a 
sphere on the g-carbon of residue 314, which is located in the inter- 
face (see Section ULF). Clearly, these residues define a unique plane 
of intersection of the molecule with the membrane-aqueous interface. 
The shaded band in Fig. 17 represents a phospholipid bilayer with a 
phosphate-phosphate distance of ~40 A, the expected thickness of the 
bilayer in the disk membrane (Saiz and Klein, 2001). The outer interface 
of the bilayer is positioned so that the polar head groups coincide with 
the intersection plane defined by the data in Fig. 16. This procedure 
then fixes the intersection plane of the molecule on the extracellular 
surface as well. 

The position of rhodopsin in the membrane deduced from SDSL data 
is compatible with the topography of surface residues in the molecule. 
Figure 17B shows a space-filling model of the rhodopsin structure with 
residues color coded according to charge, polarity, and identity of ty- 
rosines and tryptophans. It is clear that the demarcation between the 
charged and hydrophobic residues on the cytoplasmic surface defines 


276 HUBBELL ET AL. 


the same membrane-aqueous boundary shown in Fig. 16. With a fixed bi- 
layer thickness of 40 A, the intradiscal membrane-aqueous interface is 
also coincident with a plane that separates the charged, primarily an- 
ionic, from the hydrophobic residues. 

In general, the aromatic residues tryptophan and tyrosine on the ex- 
posed surfaces of membrane proteins have a high propensity for the 
membrane-aqueous interface (Reithmeier, 1995; Yau et al., 1998). Of 
the 10 solvent-accessible ( fsa > 0.15) tryptophan and tyrosine residues 
in rhodopsin, 8 are located within the membrane-aqueous interfaces 
predicted from SDSL and bilayer thickness data. Finally, the arrange- 
ment of rhodopsin and the bilayer shown in Fig. 17 places the palmitoyl 
groups at C322 and C323 (shown in green in Fig. 17B) in an ideal posi- 
tion to be solvated by the bilayer. 

An unusual feature of the disk membrane system of the rod outer seg- 
ment is the very small and uniform spacing between the surfaces of the 
membrane in the intradiscal space (Fig. 1A). This is of interest because 
the surface potential of the intradiscal surface is ~ —40 mV at physio- 
logical ionic strengths (Tsui et al., 1990), and the equilibrium spacing 
between the membrane surfaces is only ~ 20 A (Chabre and Cavaggioni, 
1975), on the order of twice the double layer thickness. What attractive 
forces overcome the significant electrostatic repulsion to determine this 
close spacing? The structure of rhodopsin provides a suggestion. Figure 
18 (see color insert) shows a model of the disk membrane based on the 
crystal structure of rhodopsin. A prominent feature of the intradiscal 
surface of rhodopsin is a finger-like loop projecting toward the opposite 
membrane surface. Prolines 12 and 23 flank the loop, which contains two 
positively charged residues (K16, R21) that point toward the opposing 
membrane surface, and a number of hydrophobic residues (F13, V19, 
V20). This sequence has among the lowest B factors in the molecule, 
and is thus presumed to be a relatively rigid feature. The projection of 
the loop from the membrane-aqueous interface is ~ 20 A, the same as 
the disk membrane spacing. Thus, the spacing of the disk may be deter- 
mined by interaction of the loop with the opposing membrane, as shown 
in Fig. 18A, possibly involving the interaction of the positive charges with 
phosphatidylserine. 

An alternative and more interesting model is the formation of 
rhodopsin dimers across the disk membranes, as shown in Fig. 18B. The 
putative tail-to-tail dimer interface is formed by side-by-side stacking of 
the 61 and £2 strands from each molecule to make a continuous antipar- 
allel 6-sheet. The Pro-16-Pro-23 loops from each monomer interdigitate 
and lay over one surface of the sheet. Intermolecular salt bridges and 
Ca?* binding sites formed from pairs of carboxylate residues, one from 
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each monomer, could aid in stabilizing the structure (not shown). It is 
interesting to note that the mutation of Pro-23, a key residue in forming 
the loop, gives rise to one type of autosomal dominant retinitis pigmen- 
tosa in which disk formation is abnormal (Liu et al., 1997). 


V. PHOTOACTIVATED CONFORMATIONAL CHANGES: THE RHODOPSIN 
ACTIVATION SWITCH 


A recent review has summarized a large body of experimental data, its 
relationship to the crystal structure, and what can be concluded about 
the mechanism of activation (Meng and Bourne, 2001). To date, assign- 
ment of specific structural changes in rhodopsin on photoactivation is 
primarily based on SDSL analysis using the set of spin-labeled mutants 
described above. Structural changes can be detected either by changes 
in mobility of RI residues at tertiary contact sites, or by direct changes in 
distances between pairs of Rl residues, and both have been extensively 
employed in mapping structural changes in rhodopsin. As discussed in 
Section V,A, photoactivation results in the movement of specific he- 
lical segments. The functional relevance of these movements has been 
assessed by interhelical cross-linking experiments, and these are summa- 
rized in Section V,B. Finally, Section V,C explores the structure-function 
relationships evident from a comparison of the crystal structure and the 
conformational changes believed to underlie activation. 


A. Analysis of Structural Changes from RI Mobility and Interhelical Distances 
Measured by Spin-Spin Interactions 


For each of the spin-labeled sites throughout the cytoplasmic domain 
of rhodopsin, EPR spectra were recorded in the dark and after pho- 
toactivation to the metarhodopsin II (MII) state, the state of rhodopsin 
competent for activation of transducin (Hofmann, 2000). Changes in 
the structure of the protein are reflected by changes in the mobility of 
the R1 side chains, which provide a map of the tertiary contact surfaces 
that rearrange during the transition to the activated form. 

The most dramatic changes, illustrated, for example, in Fig. 19, are 
detected in and around TM6, and have been interpreted in terms of a 
rigid-body tilt of TM6 outward from the center of the protein (Altenbach 
etal., 1996; Hubbell et al., 2000). For example, residues 250R1 and 251R1 
on the buried surface of TM6 dramatically increase in mobility, reflect- 
ing a decrease in tertiary contact interaction, while residues 227R1 and 
231R1 on TM5 experience a decrease in mobility as they are engaged by 
the outward movement of TM6. On the other hand, residue 252R1 on 
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Dark 
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Fic. 19. Changes in RI mobility in and around TM6 due to photoactivation. (A) A 
ribbon model of rhodopsin showing the location of 227R1, 250R1, and 252Rl in the dark 
state. (B) After photoactivation, showing the movement of TM6. (C) The corresponding 
EPR spectra in the dark (heavy trace) and after photoactivation (light trace). 


the solvent-exposed surface of TM6 remains unchanged, demonstrating 
that the secondary structure of TM6 remains intact and that the motion 
is a rigid body motion of the helix. 

Figure 20 (see color insert) shows a summary of the mobility changes 
detected by R1 sensors throughout the cytoplasmic surface of rhodopsin 
and their interpretation in terms of helical motions based on arguments 
similar to those given above for TM6 (Resek et al., 1993; Farahbakhsh 
et al., 1993, 1995; Yang et al., 1996b; Altenbach et al., 1999b). No signifi- 
cant changes were observed for residues not shown. The increases in mo- 
bility for Rl at the buried surfaces of TM1, TM2, TM3, TM6, and TM7 in- 
dicate reduced packing in the core of the protein on photoisomerization 
of retinal. In the case of residues that face TM6 (72R1 in TM2 and 136R1, 
139R1 in TM3) the mobility increases could be coupled to the dominant 
outward movement of TM6. However, the concomitant decrease in mo- 
bility of residues on the outer surface of TM2 (70R1, 74R1), and residues 
in C2 that point toward TM2 (147R1, 148R1), suggests a displacement of 
TM2 toward TM4. The subtle mobility increases in 140R1 further suggest 
a rearrangement in structure at the cytoplasmic surface of TM3, near 
the highly conserved sequence 134E/135R/136Y. Finally, decreases in 
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mobility at 313R1 in H8 have been linked to a possible movement of the 
segment of TM7 above Pro-303 (see below). The origin of the decrease 
in mobility of 316R1 is not clear, but could be due to a movement a 
H8 about a fulcrum located near 316 (such a motion of H8 would be 
coupled to the movement of TM7). 

Collectively, the data suggest movement of the TM segments as sug- 
gested in Fig. 20 by arrows. All the data leading to this model were col- 
lected for rhodopsin in micellar solutions of DM. The mobility changes 
for RI residues in TM6 and TM3 have been confirmed for rhodopsin 
reconstituted in lipid bilayers, but the changes at 316R1 are greatly re- 
duced in magnitude (N.-J. Yu, K. Cai, H. G. Khorana, and W. Hubbell, 
unpublished). Thus, the dominant movement of TM6 and smaller move- 
ments in and around TM3 on photoactivation are preserved in the native 
membrane, but the environment may modulate the magnitude of the 
changes at 316, and possibly other sites. 

The mobility changes shown in Fig. 20 clearly identify the type 
(rigid body helix motion) and direction of conformational changes in 
rhodopsin during activation, but cannot as yet provide quantitative mea- 
sures of the magnitude of the helix movements. For this purpose, direct 
distance measurements between multiple pairs of RI residues in heli- 
cal segments were made before and after photoactivation. Helix pairs 
TM3/TM6 (Farrens et al., 1996), H8/TM1 (Altenbach et al., 2001b,c), 
and H8/TM2 and TM1/TM7 (Altenbach et al., 2001c) have been in- 
vestigated in detail. For example, Fig. 21 shows results of experiments 
to map distance changes between residue 139R1 in TM3 and a second 
residue in the sequence 247-251 in TM6. In the dark, the interspin dis- 
tance between 139R1 and each of the other residues was consistent with 
the rhodopsin crystal structure (Fig. 21A). On light activation, there was 
a striking pattern of distance changes reflecting an outward motion of 
TM6 relative to TM3 of ~8 A to account for the decrease in distance be- 
tween 139R1 and 250R1 (Fig. 21B). This represents the largest distance 
change observed and corresponds to the motion of TM6 inferred from 
changes in RI mobility (Fig. 19). Similar studies were carried out with 
the double mutants shown in Fig. 9 and 14 to map relative movements 
among helices TM1, TM2, H8, and TM7 due to photoactivation, and 
the results support the movements summarized in Fig. 20, and provided 
the displacement amplitudes shown in the figure. Additional evidence 
for the movement of the C-terminal segment of TM7 away from the 
core of the protein is provided by the fact that a peptide segement near 
the TM7-H8 turn is only accessible to an antibody after photoactivation 
(Abdulaev and Ridge, 1998). The low-amplitude structural fluctuations 
in the TM7-H8 turn inferred from the reactivity of 310C and 313C also 
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Fic. 21. Changes in distance between R1 residues in TM3 and TM6. (A) Ribbon model 
of rhodopsin in the dark state showing the location of RI side chains. For each distance 
measurement, only two RI side chains were in the protein, one fixed at the reference site 
139 in TM3, and the other at a site in the sequence 248-252. (B) After photoactivation, 
showing the movement of TM6 and the new positions of the pairs. (C) The corresponding 
EPR spectra of the double mutants in the dark (solid traces) and after photoactivation 
(light traces). A decrease in spectral amplitude represents a decrease in interspin distance. 


point to a flexible structure whose role may be to support the proposed 
movement of TM7. 

This activated MII state is transient, and following the spontaneous 
decay of MII to opsin (the apoprotein), the receptor is once again in 
an inactive state, similar to rhodopsin itself. Interestingly, light-induced 
conformational changes detected by spin labels in the native membrane 
also reverse on the time scale of MII decay (Farahbakhsh et al., 1993). 
The opsin molecule produced by the decay of MII is thermodynamically 
less stable than rhodopsin, as expected from the loss of the chromophore 
binding energy. This is reflected in a dramatically increased flexibility 
in the molecule. For example, only in the opsin state are cross-links 
formed between native residues C140 and C316, whose a-carbon atoms 
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are nearly 30 Ä apart (Yu and Oprian, 1999). Moreover, the structure 
of opsin, but not rhodopsin, is strongly coupled to the thickness of the 
lipid bilayer environment (Farahbakhsh et al., 1992). 


B. Interhelical Cross-Linking Experiments and the Functional Relevance of the 
Helical Movements 


Are the helical movements detected by the spin labels functionally 
important? If so, a molecular cross-link between the helix in question 
and an adjacent helix that does not move or moves in a different di- 
rection should block activation of transducin. To implement this strat- 
egy to identify functionally relevant helix motions, two different kinds 
of interhelical cross-links have been employed. In the first, helices are 
cross-linked by spontaneous or catalyzed disulfide formation between 
two cysteines, one engineered in each helix. In the second, cross-links 
are introduced by binding of Zn?* to engineered histidine ligands, one 
in each helix (Elling et al., 1995). 

Twelve cross-links designed to explore the role of helix movement at 
the cytoplasmic surface ofrhodopsin are shown in Fig. 22. An important 
conclusion from these studies is that any interhelical cross-link involving 
TM6 blocks activation. These include the five disulfide cross-links be- 
tween 139C in TM3 and a second cysteine in the sequence 247C-251C 
in TM6 (Farrens et al., 1996; Cai et al., 1999b), the cross-link between 





Fic. 22. Cross-links introduced into rhodopsin to test role of helix movement in ac- 
tivation. Of the 12 cross-links shown, 11 are disulfide bonds produced by oxidation of 
engineered cysteine pairs. The pairs are 139/247 (i), 139/248 (i), 139/249 (i), 139/250(i), 
139/251 (i), 65/316(p), 136/222 (i), 136/225 (p), 140-222 (i), 140/225(p). One cross-link, 
138/251 (i) was produced by Zn?* binding (see text). The letter in parentheses follow- 
ing each pair identifies the cross-link as inhibitory (i) or permissive (p) with respect to 
transducin activation. Inhibitory bonds are shown in black, and permissive in gray. 
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312C in H8 and 246C in TM6 (Cai et al., 2001), and the Zn?* cross-link 
between 138H in TM3 and 251H in TM6 (Sheikh et al., 1996). These 
results strongly support the notion that movement of TM6 away from 
both TM3 and H8, as suggested in Fig. 20, is required for activation. In 
contrast, disulfide cross-links between residues 242 or 245 in TM6 and 
338 in the C-terminal tail have little effect on transducin activation (Cai 
et al., 2001). As discussed above, the formation of these disulfide bonds is 
incompatible with the crystal structure, but is consistent with the highly 
dynamic nature of the C-terminal domain found in solution. Because of 
the high mobility of the C terminus, the cross-link with residue 242 or 245 
on the outer surface of TM6 would not inhibit its movement during acti- 
vation or interaction of transducin with residues on the internal surface. 

Interestingly, disulfide bonds formed between residue 276C in TM6 
and residue 198, 200, or 204 in TM5 on the extracellular surface were 
permissive with respect to transducin activation, indicating a lack of mo- 
tion of TM6 relative to TM5 on this surface (Struthers et al., 1999, 2000). 
As will be discussed below, this important result is consistent with the 
proposal that TM6 moves about a hinge point at Pro-267, so cross-links 
on one side of the hinge have no effect on motion. 

Of five disulfide cross-links formed between residues in the cyto- 
plasmic ends of TM3 and TM5 (136C/222C, 136C/225C, 139C/225C, 
140C/222C, 140C/225C), the two involving 222C were inhibitory and 
the others permissive with respect to transducin activation (Yu and 
Oprian, 1999). The disulfides that would produce the largest distortions 
in the structure on formation are in fact permissive and the smallest dis- 
tortions are inhibitory. Thus, the mixed results cannot be explained in 
terms of protein distortion, and all that can be concluded is that large- 
scale relative movements of TM3 and TM5 are not involved in rhodopsin 
activation. Finally, a cross-link between 316C in H8 and 65C in TM1 is 
permissive, and it is concluded that a relative movement between these 
residues is unimportant for activation of the receptor. However, 316C is 
at the contact point between H8 and TMI and could be at a fulcrum 
about which H8 rocks relative to TM1. 

Other evidence that the helical motions illustrated in Fig. 20 are 
related to activation comes from both kinetic studies and structural 
changes produced by constitutively active mutations. An important fea- 
ture of SDSL is that conformational changes sensed either by side-chain 
mobility or changes in interspin distance can be monitored in real time 
in the millisecond domain. An early SDSL kinetic investigation reported 
that the light-activated structural change in TM3 sensed by R1 mobility 
changes had the same rate constant and activation energy as the appear- 
ance of MII (Farahbakhsh et al., 1993). Moreover, the conformational 
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change reversed with a decay time that matched the decay of MII. Similar 
experiments have not yet been carried out for the important movement 
of TM6. 

The apoprotein opsin activates transducin with an efficiency 10° times 
less than that of MII (Melia et al., 1997). However, a number of muta- 
tions have been found that constitutively activate the opsin to various 
degrees (Rao and Oprian, 1996; Han et al., 1998). The structural origin 
of constitutive activation has been explored with the use of “sensor” R1 
side chains placed either at site 140, 227, 250 or 316. As indicated in 
Fig. 20, RI residues at these sites serve to monitor the status of the he- 
lix bundle and report movement of TM3 (140), TM6 (227, 250) or H8 
(316). These sensor mutants were employed to monitor the response 
of the rhodopsin helix bundle to various mutations that constitutively 
activate the protein. The remarkable conclusion of these studies was that 
constitutively active mutations trigger the movement of specific helical 
segments in the dark state that closely resemble the corresponding move- 
ments due to photoactivation, supporting the functional importance of 
the helix movements (Kim et al., 1997). 


C. Structure-unction Relationships 


As described in the previous sections, cysteine scanning mutagenesis 
and the associated techniques of SDSL, sulfhydryl reactivity, and disulfide 
cross-linking rates have provided a rather detailed view of rhodopsin dy- 
namics in solution and conformational changes leading to the activated 
state. In this section, the structural origins of these functional properties 
in solution are examined from the point of view of the crystal structure. 

As shown in Fig. 4, the crystallographic B factors for C3 are the highest 
in the structure. This disorder in the crystal is clearly mirrored by the 
dynamics in solution, as shown in the scaled mobility map of Fig. 15A. In 
both the crystal structure and the solution structure inferred from SDSL 
(Fig. 11A), residues in C3 make few tertiary contacts, a fact that ac- 
counts for the disorder/dynamics. What is the functional advantage for 
the dynamic flexibility of C3? One answer is that the large-scale move- 
ment of TM6 on photoactivation (Fig. 20) requires that C3 be flexi- 
ble. C3 contains numerous sites that are apparently involved in interac- 
tion /activation of transducin (Fig. 1B). The interaction of the transducin 
o subunit with activated rhodopsin must be specific but not too strong, 
because, following GDP/GTP exchange, the subunit must dissociate 
from the receptor (Fig. 2). A high specificity can be achieved by requir- 
ing multiple attractive interactions, but this would lead to an enthalpic 
contribution favoring a high binding constant. However, the binding 
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free energy, and hence the equilibrium constant, could be limited by 
a large entropic loss that would occur upon transducin binding to the 
flexible backbone of C3. 

In the crystal structure of rhodopsin (1HZX, A chain), the C-terminal 
tail is modeled, but the B factors are high. Arguments against this 
structure in the solution state were given above, and all evidence fa- 
vors a dynamically disordered state in solution from 328 to 348, with 
little change produced by photoactivation (Langen et al., 1999). This 
sequence of the C-terminal tail is of functional importance because it 
contains residues $334, $338, and S343, the major sites of phosphory- 
lation of rhodopsin by rhodopsin kinase during the deactivation phase 
of transduction (Ohguro et al., 1994) (Fig. 1B). The high mobility of 
the C-terminal phosphorylation domain is similar to other substrates 
for protein kinases where the high mobility of the region surrounding 
the site of phosphorylation is likely to be important in recognition and 
affinity (Johnson et al., 1998). 

The extracellular domain of rhodopsin that defines the chromophore 
binding site has the lowest B factors in the structure, is tightly packed, 
and is characterized by numerous stabilizing interactions, many of which 
are between the chromophore, the transmembrane helices, and beta 
strands forming the floor of the binding cavity (Palczewski et al., 2000; 
Teller et al., 2001). On the other hand, the helix bundle at the cytoplas- 
mic domain is stabilized by weak van der Waals and hydrophobic interac- 
tions, the interhelical links are dynamic in solution and disordered in the 
crystal structure, and the domain undergoes structural rearrangements 
upon photoactivation. This general picture is in keeping with the pro- 
posal of Khorana that the extracellular surface serves as a template for 
the folding of the protein, and mutations in this region of the molecule 
often lead to misfolded protein (Hwa et al., 1997; Garriga et al., 1996; 
Liu et al., 1996). 

The most dramatic change resolved by SDSL in the cytoplasmic do- 
main due to photoactivation is the movement of TM6, with relatively 
smaller movements in the rest of the structure. It seems surprising that 
one helix of a bundle can move without destabilizing the entire struc- 
ture. The structure of rhodopsin provides clues to how this may come 
about. First, TM6 makes fewer polar interactions with its neighbors than 
other helices. As discussed previously, there are networks of hydrogen- 
bonding interactions that connect TMI-TM5 and TM7, but TM6 is con- 
nected only to TM7 by only two side chain/backbone hydrogen bonds 
(Teller et al., 2001). Second, TM6 is unique in its poor packing within 
the structure. Figure 23A (see color insert) shows cavities within the 
rhodopsin molecule identified with a probe of 1.4 A (Teller et al., 2001). 
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The largest cavity (145 A%) lies between TM3 and TM6 at the level of 
residue 258 in TM6. In addition, there is a cavity at the interface between 
TM6 and TM5, reflecting loose packing between TM6 and this nearest 
neighbor. This latter point is emphasized in Fig. 23B, which shows a 
view of rhodopsin facing the interface between TM5 and TM6. In this 
figure, the helices behind TM5 and TM6 (TM3, TM4) were removed to 
show the free volume at the interface. Between P267 and the cytoplas- 
mic termination of TM6 the helix packing is less than optimal. However, 
between P267 and the extracellular surface, TM6 is tightly packed, and 
the highly conserved P267 is the likely hinge site for the rigid body tilting 
motion of TM6 observed in SDSL studies reviewed above (see Figs. 3, 
23A, and 23B). Indeed, suitable rotations about the P267 backbone dihe- 
dral angles moves TM6 along a trajectory that accounts for the mobility 
changes at 250R1 and 227R1 (this is how the tilted helix in Fig. 21 was 
generated). This model is supported by the fact that disulfide cross-links 
involving TM6 sites between P267 and the cytoplasmic surface are in- 
hibitory with respect to transducin activation, whereas those between 
P267 and the extracellular surface are permissive (Struthers et al., 1999, 
2000; Yu et al., 1995). Other smaller helix motions observed in TMI- 
TM3 and TM7 could simply be the response of the helix bundle to the 
movement of TM6, or they may be associated with more fundamental 
events responsible for triggering the movement of TM6 itself. 


VI. SUMMARY: THE MECHANISM OF RHODOPSIN ACTIVATION 
AND FUTURE DIRECTIONS 


The focus of this review has been on rhodopsin structure and dynamics 
at the cytoplasmic face in solution, the comparison between the solution 
structure and the crystal structure, and the structural changes underly- 
ing receptor activation. The data from SDSL and disulfide cross-linking 
together indicate that the crystal and solution structures are very sim- 
ilar at the level of the backbone fold for C1, H8, and the cytoplasmic 
termination of TM1-IM7. However, substantial differences are seen in 
C3 and the C-terminal tail, wherein the backbones are flexible on the 
nanosecond time scale in solution. Not surprisingly, these are the most 
disordered regions in the crystal lattice and correspond to sequences 
important for function. 

Two independent strategies, modulation of tertiary contact interac- 
tion and direct distance mapping, provide compelling evidence for a 
dominant rigid-body movement of TM6 near the cytoplasmic surface, 
with smaller but significant movements of TM1, TM2, TM3, and TM7. 
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The large-amplitude motion of TM6 is enabled by the highly flexible C3 
link that connects TM5 and TM6 at the cytoplasmic surface. There is 
now abundant evidence for a similar movement of TM6 in other GPCRs 
(Gether, 2000), and it may be a “central dogma” of GPCR receptor acti- 
vation as proposed by Khorana (2000). However, it is possible that addi- 
tional induced structural changes may occur on binding of transducin. 

The pattern of helix motions observed experimentally suggests a sim- 
ple model for the activated state in which helices on opposite sides of 
the chromophore binding pocket move outward to open a cleft in the 
molecule (Fig. 20). In the prose of Henry Bourne, the molecule has 
“blossomed” (Meng and Bourne, 2001) to expose new surfaces for cou- 
pling of rhodopsin with transducin. 

How are these helix movements coupled to chromophore isomeriza- 
tion? A detailed mechanistic answer is unknown, but a general view is 
that geometric isomerization results in unavoidable repulsive (steric) 
interactions with the protein, the relaxation of which removes a set of 
constraints on TM6 (and others), allowing for the thermal motions nec- 
essary to form the active state (for review, see Hofmann, 2000; Arnis and 
Hofmann, 1993). The challenge ahead is to identify the specific steric 
interactions between groups on the chromophore and protein, and how 
they guide the protein along the activation pathway to remove the neces- 
sary constraints, which also remain to be identified. It is already clear that 
the ionone ring and the 9-methyl group of the chromophore participate 
in critical steric interactions with the protein (Jager et al., 1994; Borhan 
et al., 2000; Meyer et al., 2000; Vogel et al., 2000), and that the salt bridge 
between the protonated Schiff base of the retinal at K296 (in TM7) 
and the counterion E113 (in TM3) is a critical constraint that must be 
removed to form the activated state of the protein (Rao and Oprian, 
1996). Judging from the results reviewed above, systematic application 
of time-resolved SDSL to monitor helix movements, together with the 
use of chemically modified chromophore structures, is a promising ap- 
proach to exploring the relationship between chromophore structure, 
the critical salt bridge, and helix movements leading to activation. 
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I. AN ANCIENT AND LONG RECOGNIZED CHANNEL 


Acentury ago Alfred Fischer lysed pathogenic bacteria by placing them 
in hyperosmotic solutions, and found that they did not undergo lysis 
when placed in hyperosmotic glycerol solutions (Fischer, 1903). They 
did not shrink as expected from the osmotic imbalance. He concluded 
correctly that they had to be very permeable to glycerol. The “glycerol 
facilitator” was identified in a brilliant series of studies pioneered by 
E. C. C. Lin and colleagues that began in the mid-1960s (Sanno et al., 
1968) and led to the conducting specificity of GlpF (Heller et al., 1980), 
to cloning and sequencing of the gene (Sweet et al., 1990), and to charac- 
terization of the 281 amino acid 29,780 Da GlpF protein (Weissenborn 
et al., 1992). The seemingly incongruous genetic relatedness between 
GlpF, a soybean nodulin-26, and the major intrinsic protein (MIP) of the 
eye lens was identified (Baker and Saier, 1990) before the first functional 
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characterization of a water channel. Today they are all in the aquaporin 
channel family. 

The “glycerol facilitator” GIpF, a highly selective transmembrane chan- 
nel that conducts glycerol and certain other small uncharged organic 
molecules, was the first member of the large aquaporin family (AQP 
family) to be identified at the genetic level and functionally character- 
ized (Eze and McElhaney, 1978). Once glycerol is passaged inside the 
cell, it is rapidly phosphorylated by glycerol kinase to produce glycerol 
3-phosphate that, in turn, proceeds by dehydrogenation to dihydroxy- 
acetone phosphate (DHAP), or to phospholipid synthesis where glycerol 
provides the basis for attachment of fatty acid chains and phophatidyl 
head groups in ~2/3 of cellular phospholipids. The glp regulon is in- 
ducible by glycerol 3-phosphate which provides an advantage for growth 
of Escherichia coli on glycerol. The glpF gene is the first gene on the glp 
operon that also contains the gene for glycerol kinase. 

The lack of an energy-dependent process for glycerol accumulation 
was inferred by the lack of any ability to concentrate glycerol in kinase- 
negative mutants (Lin, 1976). The rate of GlpF-mediated glycerol influx 
is 100- to 1000-fold greater than expected for a transporter and is non- 
saturable to a glycerol concentration of >200 mM (Heller et al., 1980), 
providing early evidence that it is a highly selective channel. In addition 
there is no counterflow of radioactive substrate against a concentration 
gradient, again indicating that GlpF acts as a selective channel (Maurel 
et al., 1993). 

GlpF not only conducts glycerol but also can conduct urea, glycine, 
and DL-glyceraldehyde, and is both stereo- and enantioselective in con- 
ductance of linear carbohydrates called alditols (Heller et al., 1980). 
These results have been extended using highly purified glpF in reconsti- 
tuted vesicles (Fu et al., 2000). The cyclized alditols, then termed aldoses 
or sugars, are not conducted through the GlpF channel, and the struc- 
ture of the GlpF channel shows that it is too small to conduct the cyclic 
molecules and beautifully explains aspects of the stereo- and enantiose- 
lectivity (Fu et al., 2000; Nollert et al., 2001). GlpF also conducts water at 
about 1/6 of the rate that the water-conducting homologue AQPZ does. 

Though the similarity between GlpF, nodulin-26, and MIP had been 
recognized in 1990, it was not until the first water-conducting channel 
had been identified based on function (Preston and Agre, 1991) that the 
larger family had a functional definition as a superfamily of membrane 
channels that conduct water and other small uncharged molecules. The 
first cDNA of a water-conducting channel to be identified as a protein 
channel, a protein abundant in erythrocytes, and previously termed 
CHIP28 (abbreviated from channel-like integral membrane protein, a 
28 kDa protein), was discovered by Agre and colleagues (Preston and 
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Agre, 1991). The family was subsequently given the functional name 
“aquaporins” (Agre et al., 1993). Since then AQPs have been found in 
all species from bacteria to yeast, plants, and human. The sequences of 
now more than 200 AQP aquaporins have been determined. Thus, be- 
cause GlpF is in this family, most of what can be said about the structure 
of GlpF will carry over directly to other AQPs and vice versa and can be 
seen in the context of its selectivity versus other family members. 

The conservation of amino acid sequence between any two AQPs is 
typically in the range 28-32%; thus they are all constructed around a 
highly conserved structural backbone fold. The family arose by tandem 
intragenic duplication (Pao et al., 1991) to give proteins that have an 
internal repeat where the N-terminal segment displays ~20% conserva- 
tion with the C-terminal segment (Wistow et al., 1991). This duplication 
event would seem to have appeared early in evolution since bacteria such 
as Escherichia coli contain both a water plus glycerol conducting channel 
(GlpF) and a separate water channel (AQPZ). Each segment contains a 
conserved -Asn-Pro-Ala- signature sequence (-NPA-) near its center, and 
several other conserved residues including a Glu near the beginning 
of each segment. While conservation between the two repeats suggests 
similarity of roles in a bidirectional conducting channel, the C-terminal 
segment plays a more dominant role in solute specificity and regulation 
(Reizer et al., 1993). 

The aquaporins (AQPs) are a large family of genetically divergent 
channels that conduct their substrates at close to the diffusion-limited 
rate. There are 11 members with key roles in human physiology num- 
bered from AQP-0 to AQP-10. AQPs are composed of functionally dis- 
tinct subgroups that include transmembrane water-conducting channels 
(aquaporins) and channels that, like GlpF, conduct glycerol as perhaps 
the most relevant physiological substrate in humans, but also conduct 
urea, DL-glyceraldehyde, linear polyalcohols, (called alditols), and cer- 
tain other small organic molecules (Preston et al., 1992; Maurel et al., 
1994; Park and Saier, 1996). In humans AQP-3, AQP-7, AQP-9, and 
AQP-10 fall into this subclass (Verkman et al., 1995). The divergence be- 
tween the subfamilies appears to have occurred ~2 billion years ago, and 
representative ancestors of both subclasses are found in £. coli (Heymann 
and Engel, 2000). 

The major intrinsic protein (MIP) of the eye lens, once thought to 
play a primary role as a gap junction between cells of the eye lens, a 
biophysically characterized aqueous channel that formed arrays in the 
eye lens (Gorin et al., 1984), was brought back to the fold as AQP-0. 
Many of the AQPs in eukaryotes are regulated by phosphorylation, pH, 
osmolarity, or binding of other proteins or ligands (Anthony et al., 2000; 
Engel et al., 2000). 
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AQPs are highly insulating to the electrochemical potential across 
the cell membrane by sharp selectivity that absolutely excludes conduc- 
tion of all charged molecules, all ions including hydroxide and hydro- 
nium ions (Finkelstein, 1987), and protons. An important fundamental 
question is how this is accomplished, and structural data on GlpF and 
AQP-1 have given rise to suggested mechanisms for this (Mitsuoka et al., 
1999; Fu et al., 2000). Molecular mechanics has been harnessed to iden- 
tify, and then evaluate the contributions of each factor to this property 
(Tajkhorshid et al., 2002). 

There is only one recognized exception to the absolute insulation to 
ions found in human AQPs, where AQP-6 conducts anions. This adap- 
tation is particularly interesting as a means of defining the basis for this 
selectivity. AQP-6 is activated by low pH, most probably to control its 
action in membranes of intracellular organelles (Yasui et al., 1999). 

GlpF, AQP-0, AQP-1, and (it is expected) all AQPs form tetramers 
of four independent water channels in membranes (Smith and Agre, 
1991). GlpF remains a tetramer throughout purification and crystalliza- 
tion, and the structure of GlpF at 2.2 A resolution shows that it crystal- 
lizes as a tetramer (Fu et al., 2000). In reconstituted membranes these 
tetramers come together alternately cytoplasmic and extracellular side 
up. Electron crystallography of these reconstituted arrays (Cheng et al., 
1997; Li et al., 1997; Walz et al., 1997; Hasler et al., 1998b; Ringler et al., 
1999) at increasing resolutions down to 4.5 A in-plane (4.5 x ~9 A in 
3D) resolution (Heymann and Engel, 2000) reveals a right-hand twisted 
arrangement of six, and two half-spanning a-helices that surround a cen- 
tral channel (Heymann and Engel, 2000). These pioneering structural 
investigations defined the relationship of the protein fold to the lipid 
bilayer and provided the first views of how the helices are folded around 
the central “hourglass”-like channel. 

The structures of AQPs at increasing resolutions from the electron mi- 
croscopic structures reconstituted into phospholipid-based membranes 
for GlpF (Braun et al., 2000; Stahlberg et al., 2000) and culminating in 
atomic structures from X-ray crystallography at a uniform 2.2 Ä for de- 
tergent solubilized GlpF (Fu et al., 2000) and for deglycosylated AQP-1 
(Sui et al., 2001) beautifully illustrate the basis for observations of their 
selectivity and rates of conductance. The atomic structure of GlpF also 
served as a template for molecular mechanics treatments of the theo- 
retical basis for the rates of glycerol conductance in quantitative terms 
(Jensen et al., 2002) and as a basis for assembling models of all other 
AOPs including AQP-1 (de Groot et al., 2001). Though now superseded 
by the crystal structure of AQP-1 (Sui et al., 2001), these models provide 
a quantitative diagnostic as to how good such models of other AQPs are 
likely to be when modeled on GlpF or AQP-1. 
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Throughout simulations based on the high-resolution GlpF structure 
it is clear that the GlpF channel is a remarkably well behaved template 
that remains stable to molecular mechanics using current force fields 
(de Groot and Grubmuller, 2001; Jensen et al., 2002). One reason is 
the degree of correctness of the final refined structure, but another 
is perhaps that a channel that conducts molecules of neutral charge, 
maintained by an appropriate approximation to a lipid bilayer, is in 
fact very stably restrained by the balance of hydrophobic forces in the 
membrane. 


II. THREE-DIMENSIONAL STRUCTURE OF GlpF WITH GLYCEROL IN TRANSIT 


Structures of channels are extremely difficult to obtain for numer- 
ous reasons. However the first structures in each class have been critical 
to understanding mechanisms(Rees et al., 2000). The structure of GlpF 
at 2.2 A resolution (residues 6-259), with its primary permeant substrate 
glycerol, elucidates the mechanism of selective permeability (Fu et al., 
2000). Three intermediate glycerol binding sites define an amphipathic 
transmembrane pathway that conducts glycerol and certain other lin- 
ear carbohydrates. Glycerol bound close to the narrowest portion of 
the channel around conserved Arg-206 shows that glycerol molecules 
move through the channel in single file, with the alkyl backbone wedged 
against a two-sided hydrophobic corner. Two successive OH groups each 
form hydrogen bonds with a pair of acceptor and donor atoms on the 
opposite sides of a four-sided tripathic channel. The stereoselectivity of 
the channel is also revealed by functional assays of substrate conduc- 
tion. This structure of the channel explains the preferential permeabil- 
ity for linear carbohydrates and absolute exclusion of ions and charged 
solutes. 

Surrounding each monomeric, glycerol-conducting channel are six 
and two half-membrane-spanning a-helices arranged in a right-handed 
super-twisted fashion (Fig. 1). The anticipated location of the N terminus 
of GlpF was confirmed to be cytoplasmic (Fu et al., 2000) by establishing 
that accessibility of a thrombin site inserted between the histidine tag 
and the mature N terminus is inaccessible to thrombin until cells are 
permeabilized. This has also been shown to be so in aquaporins (Preston 
et al., 1994). 

The protein is constructed as two segments representing the genetic 
duplication that are related by a quasi twofold axis that would pass 
through the center of the bilayer and almost intersect the fourfold 
axis of the tetramer. The first segment (residues 6-108) forms one side 
of the channel (Fig. 2). It begins with two membrane-spanning helical 
segments, M1 (residues 6-34) and M2 (40-60). These are followed by 
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Fıc. 1. (Left) The monomeric GlpF channel is vertical. GlpF and all AQP channels are 
composed of six and two half helices distinguished in different shades of gray. The view 
is from the lipid bilayer looking in to the quasi-twofold symmetry axis, which intersects at 
90° with the fourfold axis of the tetramer that would be vertical, behind this image. These 
channels all present the NH groups of conserved side chains N68 and N203 into the center 
of the channel. The -NPA- regions are seen elaborated as space-filling representations in 
the center of the figure (medium gray). (Right) The detailed hydrogen bonding around 
the NPA conjugate pair is shown and orients the asparagine NHg groups to precisely 
position one NH from each of N68 and N203 to point directly into the lumen of the 
channel, at about the same height. There they determine the polarization of the central 
water molecule in the channel. Modified from Fu et al., 2000, Science 290, 481-486, with 
permission from the American Association for the Advancement of Science. 


seven amino acids (61-67) that project into the center of the chan- 
nel as an extended polypeptide chain that orients four carbonyls as 
hydrogen bond acceptors to the cytoplasmic vestibule (Fig. 3). This 
chain initiates M3 (68-78), the first of two short a-helices, that be- 
gins with the first -NPA- sequence near the center of the bilayer and 
returns to the cytoplasmic side. M3 packs laterally against M1 rather 
than its neighbor in the linear sequence, M2. This is followed by a 
third transmembrane helix M4 (85-108) that is packed between M3 
and M2. 

The second segment (144-254) reiterates a similar transmembrane 
topology, but beginning from the periplasmic side rather than from 
the cytoplasmic side. Two more spanning a-helices, M5 (144-168) and 
M6 (178-194), are followed by a second extended chain incursion into 
the transbilayer region from the outside. It presents four carbonyls as 
hydrogen bond acceptors to the periplasmic vestibule and initiates the 
second half-spanning helix M7 (203-216) (Fig. 3). Beginning with the 
second -NPA- sequence it returns to the outside. Finally M8 (232-253) 
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Fic. 2. The amino acid sequence of GlpF is arranged topologically as in the structure. 
The helices are viewed as if from inside the channel, viewed looking away from the fourfold 
axis out toward the lipids. The -NPA- motifs are in the center of the figure. Three and 
one half helices form each segment, labeled M1-M4 and M5-M8. Related helices MI, 
M5, M2, M6, etc., are boxed in similar shades of gray to represent their homologous 
roles in the structure. Residues in black circles interact with glycerol. Residues in gray 
circles contribute either carbonyl oxygen, or amides (NHs) to the channel, or contribute 
hydrocarbon to the channel. Light gray circles represent residues that are not seen in 
the structure. The deduced location of the hydrocarbon portion of the lipid bilayer is 
illustrated as a horizontal rectangle, and the head groups of the lipids are illustrated as 
light gray rectangles. Modified from Fu et al., 2000, Science 290, 481-486, with permission 
from the American Association for the Advancement of Science. 





returns to the intracellular surface, placing both N and C termini on the 
inside of the plasma membrane. 

Between the two genetic repeats the linking region forms two helices 
(109-120 and 126-134) as protrusions on the periplasmic side. These 
surround a concavity in the center of the tetramer where the shortest 
helices M2 and M6 meet near the fourfold axis of the tetramer, and on 
either side of the molecular quasi twofold axis. An extended chain (137- 
143) returns to the central channel and contributes three carbonyls to 
the entry vestibule. The two half-spanning helices M3 and M7, both in 
contact with the lipid-accessible exterior on the outside of the tetramer 
along one side, form an intimate contact between their N-terminal ends 
on the opposite side of the channel, related by the quasi twofold axis to 
form a coaxial bend. This unique, dimeric contact between N-terminal 
ends of two helices M3 and M7 plays a key role in maintaining a glycerol 
binding site in the center of the bilayer. The conserved “-NPA-” sequences 
(68-70 in M3, 203-205 in M7) create this interface, placing the proline 
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Fic. 3. The line of carbonyl acceptor groups that go through the GlpF channel is 
shown. The channel in GlpF is tripathic, with two of three sides being hydrophobic; one 
“side” is based on 8 carbonyl oxygens spaced at “3.0 A separation and supports a line 
of water, each with one OH acting as donor to the carbonyls through the channel. The 
central water position is acceptor from the conserved asparagine-68 and -203 NHo groups. 
This, along with the positively charged dipoles from half-spanning helices, polarizes the 
central water so that it donates hydrogen bonds to neighboring waters, thus polarizing 
them outward from the central position. This is the key to lack of proton conductance 
through the line of waters that are seen by crystallography to fill the channel with a line 


of polarized waters. 


H Arg 206 
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rings in van der Waals contact against each other, and cupping each pro- 
lyl side chain between the proline and alanine side chains of the opposite 
helix (Fig. 1). Each asparagine side chain is constrained by two hydro- 
gen bonds from main chain NHs to the side chain Oöls, which thereby 
orient the side-chain NHs toward acceptors on the permeant substrate. 


Ill. THE BASIS FOR SELECTIVITY THROUGH THE CHANNEL 


The channel pathway, defined by three glycerol molecules, has a wide 
vestibule ~15 A across on the outer surface and reaches its constriction 
of ~3.8 A x 3.4 A in size, 8 A above the central plane of the transmem- 
brane region. This constriction lies at the start of a ~28 A long selective 
tripathic channel (r < 3.5 A) that subsequently widens out again to- 
ward the cytoplasmic surface. We discuss the glycerol sites in the order 
(G1-G3) seen bya glycerol molecule moving along its trajectory from the 
periplasmic side to the cytoplasmic side, and the glycerol OH moieties 
likewise in numbered order from most external to more internal. 

Channel specificity is delineated by three glycerol molecules (Gl, 
G2, G3) with single water molecules between them and several water 
molecules that are bound in the entry vestibule and down to the selectiv- 
ity filter in the center of the bilayer. The Ol-H of Gl remains hydrated 
by water molecules. Gl in the entry vestibule is oriented by G1 O2-H 
as hydrogen bond donor to the O of Tyr-138 (2.7 A). G2 and G3 are 
both found in the selective region of the channel. G2 forms a hydrogen 
bond with one water molecule at its 1’-hydroxyl. Another bridging water 
molecule is hydrogen bonded to both G2 and G3, which suggests that 
glycerol and water are stoichiometrically cotransported. 

The selectivity filter at G2 is strongly amphipathic, with two aromatic 
rings (Trp-48, Phe-200) forming a two-sided corner, two N-H donors 
from the guanidinium side chain of Arg-206 on another side, and two 
main-chain carbonyl oxygens on the fourth side. G2 binds in the Trp- 
Phe-Arg triad leaving no free space around it, such that Van der Waals, 
hydrogen-bond, and electrostatic forces each play a role. The O1H and 
O2H of G2 are each both hydrogen bond acceptors from NHs of the 
guanidinium group of Arg-206 (2.9 A, 2.7 A) and hydrogen bond donors 
to the carbonyl oxygens of Gly-199 (2.6 A) and Phe-200 (2.8 A), respec- 
tively. The alkyl backbone of G2 is tightly packed against the aromatic cor- 
ner leaving no space for any substitutions at the C-H hydrogen positions. 

The two main-chain amide nitrogens of residues 200-201 are pinned 
in place by hydrogen bonds to the buried carboxyl of Glu-152, which 
is invariant in all AQP family members and provides a powerful 
restraining force. The carboxyl group of Glu-152 orients the three 
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adjacent carbonyls of Gly-199, Phe-200, Ala-201 in the entry vestibule 
and may increase negative charge on the carbonyl oxygens of Phe-200 
and Ala-201. Thus the binding of glycerol in the selective filter creates the 
potential for the negative charge of Glu-152 to form an electrostatic in- 
teraction with the positive charge on Arg-206, via the amides of Phe-200, 
Ala-201, and the glycerol OHs. This could accentuate the polarization 
of the glycerol OHs and imply that a permeant molecule should also be 
polarizable in cross section. 

The narrowest constriction (a putative “transition site” for transport) 
in the channel lies between G1 and G2, just above G2 O1H, and is barely 
large enough to allow passage of an HC-OH, the minimum cross section 
from the center of the glycerol molecule. At this narrowest position the 
hydrogen bond geometry from the donor NH2 of Arg-206 to an OH 
group becomes ideal, helping to specify the only possible orientation of 
the CHOH group in this putative transition site. 

G3 straddles the highly constrained invariant Asn-203 and Asn-68, the 
two “N”s at the conjunction of the two NPA sequences between M3 and 
M7. The NHg of Asn203 is donor to O1 of G3 (3.0 A) with ideal geometry. 
The NH: of Asn68 is donor to O2 of G3 (3.1 A), also with ideal geometry. 
OH3 of G3 is hydrogen bond donor to the carbonyl of highly conserved 
His-66 (2.8 A). Therefore glycerol molecules can only pass through the 
region of G2, G3 in single file, and with each CHOH within one molecule 
following its neighbor in single file. 

A second highly conserved buried carboxyl of Glu-14 is hydrogen 
bonded to the amide nitrogens of Leu-67 and His-66. Thus the carbonyls 
of His-66, Ala-65, and Gly-64 are strongly oriented in the cytoplasmic 
vestibule of the channel by interactions with the carboxyl of Glu-14. The 
oriented carbonyls are the quasi twofold equivalent of those at residues 
199-201 in the external vestibule. This creates a path of hydrogen bond 
acceptors that make a line down one wall of the channel. While glycerol 
is resolved at three sites through the translocation pathway, there may 
be other partially occupied intermediate sites that are interpreted here 
as water molecules. 

The glycerols in the selectivity filter, G2, G3 have the same handedness, 
consistent with their lying on a pathway that conducts glycerol without 
rotation as it is transported. This emphasizes the evolution of a chiral 
pathway for conduction of organic molecules that have a handedness to 
them. During conductance, glycerol becomes progressively dehydrated 
in the external vestibule as carbonyl oxygens replace its water of hydra- 
tion. Complete dehydration is not required since a water such as that 
seen between G2 and G3 can copermeate with each glycerol. Glycerol can 
move stepwise, in single file, through the selective filter by exchanging 
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one set of H-bonds for another along the closely spaced, highly oriented 
set of eight carbonyl oxygen acceptors and four N-H donor atoms. Alter- 
natively, glycerol diffusion may be visualized as a series of thermally ac- 
tivated transitions over the potential barriers that separate binding sites 
through the channel including those at G2, G3 (Lauger and Apell, 1982). 
In two positions, G2 O1H, and G2 O2H, the donors and acceptors 
from the channel are oriented with ideal geometry, requiring the dual 
role of donor and acceptor ~120° apart from each of two successive 
groups in the permeant molecule. The hydroxyl moiety almost uniquely 
matches the requirement for hydrogen bond donor and acceptor from 
the same atomic center. Binding the OH of a permeant molecule at 
G2 may favor the local potential energy minimum by bridging a long 
distance electrostatic interaction (8.0-8.5 A) between two charged side 
chains of Glu-152 and Arg-206 in the entrance to the selective filter. 


IV. ROLES OF CONSERVED RESIDUES: FUNCTIONAL AND STRUCTURAL 


Conservation across the AQP family identifies conserved structural 
and functional roles. The 25 most conserved residues are classified ac- 
cording to their locations (Table I). Six residues are conserved only in 
the first segment (group I). These residues are all close packed together 
in a structural core of the first segment that is very close to Asn-203, 
suggesting that this core is conserved in structure and orients Asn-203 at 
the G3 site. Three conserved residues found only in the second genetic 
repeat (group I) are found in disparate regions, with one, Arg-206 as 
an essential functional determinant, at the beginning of the selectivity 
filter. 

The gene-duplicated heritage represented by ~20% sequence iden- 
tity between the two segments, residues 6-108 and 144-254, is found 
throughout the AQP family. This is represented in the structure as a 
quasi twofold relationship between these segments (Cheng et al., 1997; 
Walz et al., 1997; Fu et al., 2000). Correspondingly several of the highly 
conserved residues between segments land 2 (group III) have structural 
roles, and others have more functional roles. Of the structural class, the 
most conserved residues all lie near the center of the bilayer. Of eight 
pairs of residues conserved between both genetic repeats (group IN), 
six pairs are located close to the plane through the center of the trans- 
membrane region and related by the structural quasi twofold axis. They 
are found in pairs of side chains that contact each other, and these con- 
tacting pairs are twofold related to other similarly conserved pairs. This 
arrangement suggests a possible collective structural role that may be 
related to packing around the quasi twofold symmetry axis in GlpF. 
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TABLE I 
Some Conserved Residues and Sequence-Structure Relationship in GF ab 





Residues Location in structure 
Group I 
S63 Core packing near N68 
H66 Core packing near N68 
V71 Core packing near N68 
T72 Core packing near N68 
Y89 Core packing near N68 
Q93 Core packing near N68 
Group II 
L159 In the channel, near G3 
R206 Key selectivity filter residue G2 
P240 Packing core near N203 
Group III 
E14/E152 Orient amides in the vestibule 
T18/T156 Packing in mid-membrane plane 
G49/G184 Packing in mid-membrane plane 
G96/G243 Packing in mid-membrane plane 
G64/G199 Orients amides in the vestibules 
NPA (68-70) /NPA (203-205) Mid-membrane plane, and G3 


“ From Fu et al. 2000, Science 290, 481-486, with permission from the 
American Association for the Advancement of Science. 

H The roles of conserved residues are indicated. First the rank order of 
the sequence conservation was evaluated according to the informational 
entropy at every position in the sequence alignment of 46 representa- 
tive AQP family members. The first group I includes residues only found 
in the first genetic repeat. The second group II includes residues con- 
served in the second repeat only, and group IH includes residues that 
are present at equivalent positions and conserved in both repeats I and I. 
Groups I and II therefore may be related to structure and function specific 
to each respective repeat, while group III reflects the symmetric relation- 
ship between the two genetic repeats. 


Four of the group III residues are conserved glycines located within 
helices. Far from any more usual role in disrupting helices of soluble 
proteins, these transmembrane helix glycines allow the closest approach 
in the helical bundle (Javadpour et al., 1999) that occurs in the central 
plane. These glycines are invariant or highly conserved to fulfil their 
role in the close contacts at the helix crossing contacts (MacKenzie 
et al., 1997). 

The helices diverge outward from this central plane of closest contact 
to generate the vestibules, one on on each surface. The interhelix angles 
of ~+35° to +40° are not in the categories most favored by knobs into 
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holes packing (Walther et al., 1996). However, helix packing within the 
bundle is under powerful constraints from the hydrophobic bilayer and 
from the interfaces between monomers in the tetramer where helices 
pack with more optimal knobs into hole packing angles of ~—20°. 

The two conserved, buried glutamates Glu-14, Glu-152 have clear func- 
tional roles, each orienting four carbonyls of successive amino acids in 
the vestibules. The conserved pair of group III Gly-64 and Gly-199 are 
symmetrically located at the extremities ofthe central selective filter and 
serve to structure the downturn of the extended chains from 65 to 68 
and 200 to 203 that form the critical line of four carbonyls each that in 
turn provide the pitons for successive hydrogen bond donors on their 
way through the channel (Fig. 3). 

The conservation of the -NPA- sequences at 68-70 and 203-205 can 
be rationalized in both structural and functional terms. Structurally it 
preserves a key signature contact between M3 and M7. There are plausi- 
ble structural reasons why residues that are in contact around the quasi 
twofold axis such as the pair of NPA motifs and the Gly-49, Gly-184 pair 
would remain conserved, since each effectively resolves two packing situ- 
ations at once. All these residues related by the quasi twofold within GlpF 
are among the most conserved residues throughout the entire AQP pro- 
tein family. This therefore suggests that the GlpF structure is prophetic 
of all other AQP relatives. 


V. STEREOSELECTIVE PREFERENCES OF GlpF AMONG LINEAR ALDITOLS 


Osmotic methods have been used to measure influx of carbohydrates 
into proteoliposomes, reconstituted using purified GlpF. The rate of in- 
flux of the tested molecules is indicated by recovery of the initial volume 
of the proteoliposome. Ribitol, with all OH groups having the same 
stereospecific relationship to the carbon backbone, generated a clear 
exponential reswelling with a time constant of ~0.35 s™!. In contrast 
D-arabitol, a chiral stereoisomer of ribitol with a mixed arrangement of 
hydroxyls, shows a ~10-fold reduction in transport relative to ribitol, 
demonstrating stereoselectivity of the channel. 

Membranes have a high intrinsic permeability for glycerol. When GlpF 
is present the reswelling rate increases such that the vesicles recover by 
a factor similar to that seen with ribitol. To ensure that this rate is due to 
GlpF, this rate was measured as a function of the amount of GIpF used in 
the reconstitution. The linear dependence demonstrates that the trans- 
port is directly proportional to the number of GlpF molecules per vesicle. 

The rates demonstrate a stereoselective channel, consistent with the 
exact positioning of donors, acceptors, and hydrophobic contacts. The 
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structure of the central selectivity filter shows that it can accommodate 
either prochiral carbon orientation, placing the hydrogen attached to 
the carbon in contact with the aromatic side chain either of Phe-200 or 
of Trp-48, depending on its chirality. Therefore the conductivity is in 
principle different depending on the enatiomer in the selectivity filter, 
or which way glycerol enters the channel. 

Stereoselectivity is seen between different chain lengths and different 
chiralities. As an example there is a factor of 10 difference in rate be- 
tween ribitol and its stereoisomer D-arabitol. The structure shows how 
this can be visualized since two successive -CHOH groups are oriented 
at the G2 site. The carbon backbone must be lined up along the channel 
axis. Depending on enantiomer, the hydrogens attached to the carbon 
are in contact with the aromatic ring either of Phe-200 or of Trp-48, that 
forms two sides of the four-sided channel in similar environments. How- 
ever, any CHOH groups adjacent to these two will alternately place the 
carbon in one of two tetrahedrally disposed sites that have quite differ- 
ent environments. Of those tested, chiral molecules had lower rates than 
nonchiral molecules. There is no conductivity of cyclized pentoses and 
hexoses that can be rationalized as due to their size. The reswelling rates 
observed follow the same trend as was observed for GlpF upon induction 
in whole cells (Heller et al., 1980), showing that in vivo characteristics 
are represented in our assay using recombinant protein reconstituted 
into vesicles. 


VI. SIMULATIONS AND RATES OF GLYCEROL PASSING 
THROUGH THE CHANNEL 


Molecular mechanics provides a method of assessing various contribu- 
tors to the rates of events. Though such methods are often criticized for 
leading structures to diverge from the experimentally observed, they re- 
main the constant hope for amore rational and quantitative assessment 
of mechanisms. In all, 20 hydrogen bond acceptors and 23 hydrogen 
bond donors participate in the conductance of glycerol (Nollert et al., 
2001; Jensen et al., 2002). The methods also can be held to account for 
the experimental values of measured properties, and if necessary recti- 
fied so that differences between one channel structure and another of 
different property can at least be suggested and used to redirect the next 
wave of experiments. In the context of GlpF, the first structure of an AQP 
at adequate resolution, such calculations seem to be quite robust such 
that after a 1 ns equilibration time the model remains within 1 A rms of 
the crystal structure, leaving the channel structure essentially identical 
to the bounds of motion and structure obtained by X-ray crystallography. 
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The passage of glycerol evokes little if any change in the structure of 
the channel, as also seen by comparing the crystallographic structure in 
glycerol, versus that in water alone (Tajkhorshid et al., 2002). Therefore 
there is theoretical and experimental support for the conclusion that 
there is no structural gating of the channel; it is a highly stable, open, 
selective channel. 

For molecular mechanics, the GlpF tetramer is typically embedded 
in a model of 16:0/18:1c9-palmitoyl-oleyl-phosphatidylethanolamine 
(POPE) lipid bilayer fully hydrated by water, and subjected to molec- 
ular mechanics at 310K and atmospheric pressure. 

To determine the rate of conductance of a permeant molecule, it 
is necessary to drive the molecule through the channel since it would 
take far too long for a single molecule to find its way from one side 
to the other without any directional assistance. In physiological terms 
the driving force is presented as a concentration gradient, such that the 
free energy seen by the permeant molecule has a net downhill slope 
to it. In molecular mechanics terms, a theoretical external force f(t) 
is applied to move a glycerol molecule through the channel at con- 
stant velocity (Jensen et al., 2002). The necessary force is related to 
the gradient of an energy profile through the channel at each partic- 
ular position. The gradients are connected by a smooth mathemati- 
cal function based on sums of sin wave functions to reveal an energy 
landscape through the channel. The potential of mean force provides 
a means of effectively mapping the energy landscape seen by a glyc- 
erol molecule in passage through the conducting pathway. Jarzynski’s 
identity between free energy and irreversible work allows for reconstruc- 
tion of the potential of mean force along the conduction pathway by 
sampling a series of time snapshots through the molecular dynamics 
trajectory. The experimentally observed binding sites were thus justi- 
fied in molecular and thermodynamic terms. And barriers inside the 
channel could be associated with transition-state free energies, and 
hence rates of conduction. The energy profile also reveals a low-energy 
periplasmic vestibule suited for efficient uptake of glycerol from the 
environment. 


VII. SIMULATION AND RATES OF WATER PASSAGE THROUGH 
THE GlpF (AN AQP) CHANNEL 


To understand the conduction of water through AQP channels such 
as the GlpF channel, the crystal structure of GlpF with only water in the 
channel at 2.7 A that reveals the mean positions and probabilities of po- 
sition can be compared with a molecular simulation in which 7-9 water 
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molecules constitute a single file in the 20 Ä constriction region of 
the channel. The calculated diffusion constant corresponds to a flux 
of 2.4 x 10° el. which is 5 times larger than the deduced flux for GlpF 
(Borgnia et al., 1999a,b; Borgnia and Agre, 2001) of 0.5 x 10° s-!. The 
observed difference may be due to rate-limiting steps, such as the en- 
trance of water into the channel, which cannot be included in the calcu- 
lation ofa one-dimensional diffusion constant. The agreement between 
simulation and the crystal structure is in general very good, indicat- 
ing that the water molecules form a stable line of 7-9 water molecules 
through the entire channel, and corroborating that the force fields and 
assumptions necessary in molecular mechanics are a good representa- 
tion of the situation in this type of channel. 

Molecular mechanics is a rapidly evolving field that is now finding 
remarkable successes in the membrane channel field which may be 
well suited to the conditions applied. Thus it is most useful not only 
when molecular mechanics can rationalize the experimentally deter- 
mined rates of conductance and selectivities, but when it also serves to 
allow predictive simulations that test hypotheses prior to experiment. In 
these cases the technology has reached a most useful prospective phase. 

The structure and the simulation indicate that the water molecules 
proceed together through the channel driven by osmotic pressure. The 
water molecules are also polarized by the carbonyls such that they each 
are donors to the eight carbonyls that line the channel, but such that 
the central water molecule is oriented by the central asparagine side- 
chain NH donors to polarize the entire line of waters that face outward 
from that central position, as was postulated by Murata et al. (2000) from 
their 3.5-6 A electron microscopic structure of AQP1. The helix dipoles 
of helices M3 and M7 also place their positively charged ends together 
near the NPA regions; thus they, too, have been suggested to play a role 
in orienting water molecules at the center, so “bipolarizing” the line of 
water. In addition it is suggested based on molecular simulations that the 
full charge present on Arg-206 may play a role in polarizing the central 
water molecule (de Groot and Grubmuller, 2001). 

The relative contributions of these three contributors to polarizing 
the central water could be assessed for the degree of their contribution 
by “turning off” each one of the charges associated with the polarizing 
dipoles in turn (Tajkhorshid et al., 2002). When the partial charges on the 
NHs-groups of the side chains of Asn-68 and Asn-203 are turned off, 
the effect on water configuration is partial. As the partial charges on 
the amide backbone atoms of the half-helices M3 and M7 are turned 
off, which effectively eliminates the helix dipoles, a partial effect is also 
observed. And as this is combined with the NHe groups of the side chains 
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of Asn-68 and Asn-203, the bipolar orientation of the single-file arrange- 
ment breaks down, potentially leading to a proton wire. Although the 
dipoles of the channel-lining carbonyl groups alone cannot maintain the 
bipolar orientation of water molecules, their absence significantly accel- 
erates the relaxation toward orientational disorder. Therefore each of 
these factors contributes to the abrogation of any proton conductance 
through the channel. 


VIII. INSULATION AGAINST PROTON CONDUCTION IN AQPS 


The conduction of protons through water can be remarkably fast, pro- 
ceeding by the Grotthus mechanism (Grotthuss, 1806) in which protons 
hop, perhaps cooperatively, throughout a line of hydrogen bonded water 
molecules, as is observed in the proton tunneling through ice. If the pro- 
tons move cooperatively, no charge need ever be seen in the center of 
the channel, where the fully charged Arg-206 might become a major re- 
pulsive element, nor any bulk transfer of water take place, yet the proton 
could easily “leak” through the line of water. The molecular mechanics 
dissection of the contributors shows that the polarized central water, as 
implied by the crystallography (Fu et al., 2000; Nollert et al., 2001; Sui 
et al., 2001; Fu et al., 2002), is key to this insulation (Tajkhorshid et al., 
2002; de Groot and Grubmuller, 2001). 


IX. QUATERNARY STRUCTURE OF GlpF (AND AQPs) 


Each tetramer comprises four closely associated monomeric channels 
circled by a hydrophobic surface long enough to span the lipid bilayer 
(Fig. 4). Toward the cytoplasmic and periplasmic surfaces are layers that 
include side chains of tyrosine and tryptophan that can productively 
interact with the polar-nonpolar interface in the lipid head-group re- 
gion as in other integral membrane proteins (Koeppe and Anderson, 
1996). These layers are flanked by two outer layers of charged residues, 
35 A apart, that result in net positive charge on the cytoplasmic side. 

Each tetramer has a central concave surface on the periplasmic side 
where the two central transmembrane helices M2 and M6 of each 
monomeric channel in the tetramer are not long enough to span the 
bilayer, consistent with the tetramer being the physiological quaternary 
structure. The interfaces between subunits are almost as hydrophobic as 
the exterior, suggesting that a monomer could plausibly lie in the mem- 
brane, though under stress due to mismatch of thickness that might lead 
to changes in the orientation of the short transmembrane helices, and 
therefore in conductance properties of such a postulated monomer. 
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Fic. 4. A surface representation of the tetramer shaded according to electrostatic po- 
tential: Light gray represents negative, dark gray positive charge. Top left is a view from 
the periplasmic side that is predominantly negatively charged. The center is a fourfold 
symmetry axis that relates all four channels in the image. The centers of the channels 
can be seen to traverse the entire molecule, leaving a white zone seen clearly throughout 
the entire channel. At the top right the view is from the cytoplasmic side and is predom- 
inantly positively charged. Bottom left is a side view of a space-filling representation of 
the structure and shows ordered detergent molecules of octyl glucoside as they associate 
with the periplasmic leaflet interfacial zone. Bottom right is an electrostatic surface view 
of the same, showing the preponderance of negative charge on the periplasmic side of 
the membrane, and the positive charge on the cytoplasmic surface. 


The tetramer seen in the GlpF crystal structure has close similar- 
ity to the tetramers that associate, alternate sides up, in reconstituted 
membranes to form two-dimensional crystals, as characterized by image 
reconstructions of AQP1 (Cheng et al., 1997; Li et al., 1997; Walz et al., 
1997; Hasler et al., 1998a,b; Mitsuoka et al., 1999; Ringler et al., 1999). 
However, the GlpF tetramers are each surrounded by detergent, and the 
packing between neighboring tetramers is therefore different. For ex- 
ample, GlpF could reasonably be inserted into membranes as monomers 
during synthesis, and subsequently gain stability by tetramerization. 

Twelve ordered octyl glucoside molecules are associated with the aro- 
matic regions near the external side of each tetramer (Fig. 4). The 
pyranoside rings interact laterally with the external tryptophan/ tyrosine 
layer, while the alkyl chains extend toward the hydrophobic core. 
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X. THE ION CHANNEL IN AQP6; A POSSIBLE PORE ON 
THE FOURFOLD AXIS OF AQPs? 


Mostion channels are composed of multimers of proteins. Sometimes 
they are homomeric as with the MscL stretch-activated channel potas- 
sium channel (Chang et al., 1998); sometimes they are composed of 
similar subunits as in the acetylcholine receptor (Stroud et al., 1990). 
Therefore the tetramer in AQPs begs the question as to what proper- 
ties pertain to the central fourfold axis. There are three density peaks 
on the fourfold axis near the external surface. One of these peaks, that 
refines to 12(+3) electrons, is coordinated at 2.9 A by four otherwise 
buried glutamates (Glu-43) in a square planar arrangement. Below this 
peak is a second, waterlike peak that is 2.2 A from the first peak, im- 
plying that the first peak is an octahedrally coordinated cation, most 
likely Ma?" (present at ~300 mM), that is bound in the fourfold axial 
pathway. Physiologically Mg?* (~1 mM) or Nat (~145 mM) could ful- 
fill this role. The remainder of the fourfold transmembrane pathway 
is surrounded by exclusively nonpolar aliphatic side chains making it 
extremely hydrophobic, and seemingly closed to all hydrated ions and 
polar molecules larger than water. While this is the case in GlpF, there 
is a central cavity ~10.6 A in diameter that resembles the structure in 
the KcsA potassium channel. And in AQP6, the constriction on the cy- 
toplasmic side is significantly wider, suggesting that it then could be an 
ion-conducting channel. 


XI. GlpF CHANNEL SELECTIVITY FOR ANTIMONITE 


Escherichia coli resistant to antimonite have a disruption in the glpf 
gene suggesting that GlpF conducts antimonite, the neutral pH form 
of which is Sb(OH)s; therefore it is expected that the channel again 
provides adequately positioned hydrogen bond accepting carbonyls to 
support the polar side of the Sb(OH)s molecule. 


XII. SELECTIVITY AGAINST PASSING IONS OR AN 
ELECTROCHEMICAL GRADIENT 


A fascinating question pertains to the necessity for all plasma mem- 
brane channels to remain insulating to any electrical leakage of ions. 
How is this accomplished? Common to GlpF and AQPs is their abso- 
lute selectivity against ions and charged solutes. Therefore, comparison 
with known ion channels helps to define what is required to allow ion 
conductance in a channel. While the channel is large enough to hold 
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an ion such as Nat, Kt, Cat, or CIS, it is too small to accommodate a 
hydrated ion, and by its amphipathic nature provides no replacement 
for water of hydration on one entire side. The energetic cost of remov- 
ing even a single water of hydration from an ion can be prohibitive. 
Therefore cation conducting channels provide oriented carbonyl groups 
that carry a ö-charge. These can replace any water (s) of hydration dis- 
placed within any narrow selective region that intrudes on the first hy- 
dration shell (Doyle and Wallace, 1997; Chang et al., 1998; Doyle et al., 
1998; Koeppe et al., 1999; Roux and MacKinnon, 1999). 

By requiring a group that is polarized with both ô+ donor and 6— ac- 
ceptor character, both anions and cations are excluded. The two charged 
side chains of Arg-206 and Glu-152 might impose an energy barrier to 
passage of a charged ion or solute molecule including OH- or H3O?. 
Since Arg-206 and Glu-152 are conserved across almost all AQPs and 
GlpF, electrostatic repulsion would play the same role in AQPs. That no 
fully charged counteranion is bound to Arg-206 where glycerol fills the 
site further emphasizes that removal of water of hydration from an anion 
without compensation is enough to prevent any passage of ions through 
this selectivity filter. 

Removal of water and the fixing of a specific conformation of glycerol 
in the GlpF channel are also energetically costly. However, the structure 
specifically matches the donor plus acceptor and amphipathic nature of 
each of a series of C-OH groups, replacing the hydration shell in water 
by a precise match within the channel. 


XIII. THE VARIOUS CONTRIBUTIONS TO REJECTION 
OF PROTON CONDUCTANCE 


A most interesting conundrum is how AQPs completely exclude con- 
ductance of protons that could be lethal to any cell. This is especially 
true in these channels that can support a line of water molecule posi- 
tions through the channel region. The Grotthus mechanism involves the 
protonation of one water from a donor at one side of the mebrane, and 
the concerted passing of another proton from that water molecule to 
its neighbor through the channel. In principle, if concerted throughout 
the line of water molecules, this mechanism could leak protons from 
one side of the membrane to the other, without generating any charge 
within the 28 A narrow region of the channel. 

To analyze the basis for water conductance of GlpF we determined the 
crystal structure of GlpF in the absence of glycerol (Tajkhorshid et al., 
2002). This structure at 2.7 A resolution showed that water molecules 
form a line through the channel in which each water molecule is a hy- 
drogen bond donor to one of the eight carbonyls that line the channel, 
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and also donor to a neighboring water. The only exception is for the 
central water molecule that is hydrogen bond acceptor from both con- 
served asparagines of the -NPA- box. Lying opposite the hydrophobic 
corner, its hydrogens point away from the center and become hydrogen 
bond donors to the neighboring waters on both sides, so bipolarizing 
the entire line of waters, making each water an acceptor from water on 
its inside relative to the center of the bilayer at W 522, opposite the 
conserved asparagines at the channel center. 

There are two places at which there is no electron density for water 
molecules that we would expect to be good sites for water. One is tri- 
angulated opposite the terminal NHg group of conserved Arg-206 and 
the carbonyl of Gly-199 (z ~ —3 A relative to z = 0 A at the carbonyl 
of Ala-201, becoming positive toward the cytoplasmic side), leaving a 
distance of almost two hydrogen bonds, 5.5 A, between the waters adja- 
cent to this site. The second case where density is absent is opposite the 
carbonyl of the gene-duplicated Gly-64 at the other end of the channel, 
leaving a similar gap of 4.5 A. The lack of density at these sites may be for 
several reasons: (1) There may be a break in the line of waters here. If so 
this would already provide some basis for the insulation toward passage 
of protons. However, this seems unlikely in light of the ideal sites, even 
though one of them lies at the narrowest region of the channel. (2) This 
reasoning put slightly differently could be restated that the probability of 
finding waters at these two sites is somewhat lower perhaps because they 
are somewhat higher in energy. Molecular mechanics tends to support 
this conclusion for the first of these waters that lies at the narrowest part 
of the channel. (3) There may be a somewhat disordered water molecule 
at each site, thus blurring and flattening the density. Even though the 
crystallography is carried out at —100°C where water is immobilized it 
still may be frozen differently in each molecule, thereby still being absent 
from the density. 


XIV. SELECTIVITY FOR GLYCEROL VERSUS WATER 


GlpF also conducts water, but at a relative rate of 23 versus 154 per 
second for the water channel AQPZ (Borgnia and Agre, 2001). The con- 
ductivity of water preferring AQPs AQPZ and AQP-1 for glycerol, urea, 
or other uncharged small molecules is essentially undetectable. Con- 
ductivity for water in the absence of glycerol may be severely impaired 
by the amphipathic nature of the channel, since the hydration shell of 
water itself is quite stringent, with many more neighbors than found in 
carbohydrates. Several waters at one time, required to pass the filter in 
single file by the structure, could not retain more than two neighbors 
in the center of this channel. The dehydration of water would therefore 
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make its transport energetically more costly than transport of the water 
molecules seen between OH groups on successive glycerols or alditols. 

AQPs have a high permeability to water and not glycerol, whereas a 
subclass is permeant to both water and glycerol. The geometric require- 
ments for hydration of water are stringent, with many four- and five- 
coordinate waters (Bernal and Fowler, 1933; Eisenberg and Kauzman, 
1969). All residues throughout the selective filter are highly conserved 
between GlpF and AQPs except for Trp-48 (Phe or His in AQPs), Phe-200 
(typically Ala, Thr, or Cys in AQPs), and Ala-201, all in the narrowest re- 
gion of the filter. Therefore the larger size in the AQP filter will support 
an increased number of water molecules, so alleviating the barrier to 
water that would be imposed in GlpF by desolvation of several waters 
in single file, and allowing a greater degree of water coordination. At 
the same time removal of the indole of Trp-48 and the phenyl ring of 
Phe-200 will remove the hydrophobic corner that can no longer provide 
Van der Waals interaction with the alkyl chain of glycerol in AQPs. 

In AQP1, C191 is responsible for mercury inhibition of transport 
(Prasad et al., 1998). It is positioned similarly to its homologous residue 
Phe-200 in GlpF at the narrowest position of the channel. Thus mercury 
inhibition in aquaporins is brought about by a chemical modification of 
the thiol in the selectivity filter. 

Mutagenesis of two residues, Tyr236Pro and Trp237Leu, in AQP1 al- 
lowed it to pass glycerol (Lagree et al., 1999). In GlpF Pro-236 lies in the 
protein interior in contact with Ala-201 side chain behind G2. Leu-237 
contacts Pro-204 of the second NPA sequence at the G3 site. Therefore 
mutational changes at these sites can be relayed to the filter to favor 
passage of glycerol. 

Substitution of the conserved Arg-206 in a water channel, AQPZ 
(Arg189Val/Ser), inactivated water transport (Borgnia et al., 1999b). A 
mutation of the conserved Arg-206 in the related aquaporin-2 (AQP2) 
(Arg-187) is known to cause nephrogenic diabetes insipidus in humans. 
It is believed that the disease is caused by an impaired intracellular mem- 
brane traffic as indicated by experiments demonstrating that human 
AQP2 Arg187Cys expressed in Xenopus oocytes is not transferred to the 
cell membrane (Deen et al., 1995). However, the location of Arg-206 in 
the selectivity filter must alter the water transport of AQP2 in collecting 
duct principal cells in the kidney. 


XV. REGULATED ION CHANNELS FORMED BY MEMBERS 
OF THE AQP FAMILY 


Members of the AQP family are also thought to act as regulated ion 
channels in addition to their primary channel selectivity (Yool et al., 
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1996; Agre et al., 1997). However, anion conductance of AQP6 at low pH 
(Yasui et al., 1999) and cGMP gated potassium conductance of AQP1 
(Anthony et al., 2000) have been demonstrated. In this regard the trans- 
membrane pore formed between molecules around the fourfold axis 
deserves attention. This pore has a bound cation at the external surface, 
and the variations in pore diameter (Smart et al., 1997) (%min = 1.17 A) 
are closely similar to those of the (also tetrameric) KcsA potassium 
channel throughout its length (fmin = 0.53 A) (Andersen and Koeppe, 
1992). In fact the diameters of the GlpF tetramer are greater except 
at one position near the cytoplasmic surface, and the narrow portion 
of the fourfold axis is shorter than in KcsA. Like the KcsA channel it 
also has a r ~ 4 A cavity in the center of the bilayer, and it is ~10 A 
shorter than the KcsA channel, making this a shorter track across the 
bilayer. The main impediment to regarding this as an ion channel is 
only that the walls are hydrophobic in GlpF (which is not an ion chan- 
nel). The fourfold axis therefore deserves consideration as a candidate 
for becoming an ion channel in other AQPs where the structure re- 
sponds to phosphorylation, ligand binding, pH, or interaction with other 
proteins. 
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M intermediate, bacteriorhodopsin, 
121-125 
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Mitochondria 
bovine heart, cytochrome Ae complex 
cytochrome b binding sites, 158-162 
electron bifurcation, 162-165 
ISP extrinsic domain movement, 
162-165 
overall structure, 156-158 
overview, 153 
proton motive Qcycle, 154-156 
quinone binding sites, 158-162 
redox-active subunits, 154-156 
membrane protein assembly, 10-11 
M > N reactions, bacteriorhodopsin 
protonation pathways, 125-126 
Models 
ion channel voltage sensor, 227-228 
transmembrane helix thermodynamic 
studies, 24-25 
ubiquinol binding site, 173-174 
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N intermediate, bacteriorhodopsin, 123-125 
Nitrogen, transmembrane helix hydrogen 
bonding, 37-39 
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NMR, see Nuclear magnetic resonance 

Nontraditional hydrogen bonding, 
transmembrane helices, 39-41 

N > O reactions, bacteriorhodopsin 
protonation pathways, 125-126 

Nuclear magnetic resonance, f-barrel 
protein structures, 49-50 
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O intermediate, bacteriorhodopsin, 
123-125 
OmpA 
B-barrel folding and stability, 62-63 
B-barrel structure, 53 
OmpGC, f-barrel function, 60 
OmpF 
B-barrel folding and stability, 63 
B-barrel function, 60 
OmpLA 
B-barrel folding and stability, 63 
B-barrel function, 59 
P-barrel structure, 54-55 
OmpT 
B-barrel function, 59 
P-barrel structure, 54 
OmpX 
B-barrel folding and stability, 63 
B-barrel function, 59 
B-barrel structure, 53 
Osmotic pressure, mechanosensitive ion 
channels, 182-183 
Oxidation reactions, Photosystem II, 
protonation, 100-102 
Oxygen, transmembrane helix hydrogen 
bonding, 37-39 
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PDB, see Protein Data Bank 
Peroxisomes, membrane protein 
assembly, 12 

Photosynthesis 
diffusion management, 103-105 
protein relaxations, 95-96 
protonation archetype, 96-100 
protonation in Photosystem II oxidation 
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Photosynthesis (Continueed ) 
proton transfer and coupled PT/ET, 95 
proton transfer design issues, 93-95 
proton transfer design robustness, 
102-103 
Photosystem II, oxidation reactions, 
protonation, 100-102 
Photosystems 
length scales overview, 72-75 
natural redox protein design, length 
management, 75-78 
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photosynthesis 
diffusion management, 103-105 
protein relaxations, 95-96 
protonation archetype, 96-100 
protonation in Photosystem II oxidation 
reactions, 100-102 
proton transfer and coupled PT/ET, 95 
proton transfer design issues, 93-95 
proton transfer design robustness, 
102-103 
simulation with tunneling theory, 90-92 
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Prokaryotic cells 
mechanosensitive ion channels 
archael MscS-like proteins, 203-204 
gating transitions, 183-185 
helix packing, 183-185 
mechanosensitivity aspects, 204-205 
MscL 
amino terminus, 189 
computational studies, 199-200 
cytoplasmic domain, 192 
extracellular loop, 191 
gating mechanism, 192-194 
open state, 194-199 
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113-114 
Protein relaxations, photosynthesis, 
95-96 
Protonation 
bacteriorhodopsin, 125-126 
Photosystem II oxidation reactions, 
100-102 
purple bacteria QA/Qß, proton coupling 
first electron transfer, 96-98 
second electron transfer, 98-100 
Proton conductance 
aquaporin rejection, 310-311 
insulation in aquaporins, 307 
Proton coupling, purple bacteria Qa/Qp 
first electron transfer, 96-98 
second electron transfer, 98-100 
Proton motive Qcycle, cytochrome bc 
complex, 154-156 
Proton transfer 
design issues, 93-95 
design robustness, 102-103 
photosynthesis, 95 
Wolinella succinogenes QFR, 142-146 
Purple bacteria, Qa/Qpg, proton coupling 
first electron transfer, 96-98 
second electron transfer, 98-100 
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Qp binding pocket, electron bifurcation 
mechanism, 165 
OFR, see Quinol:fumarate reductases 
Quinol:fumarate reductases 
coupled electron/proton transfer, 
145-146 
crystal forms, 134 
electron and proton transfer, 142-145 
function, 132 
menaquinone oxidation, 141-142 
menaquinone reduction, 141-142 
soluble and membrane-embedded 
subunits, 141 
subunit A, 134-136 
subunit B, 136-137 
subunit C, 137-139 
Quinones 
cytochrome bc; complex, 158-162 
purple bacteria, proton coupling 
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first electron transfer, 96-98 
second electron transfer, 98-100 
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RC proteins, see Reaction center proteins 
Reaction center proteins 
chlorin symmetry, 87 
design, 76-77 
electron transfer rate prediction, 82-84 
length scale overview, 72-75 
overview, 71-72 
size and length management, 79-82 
Redox chains 
charge recombination, 88-90 
heme chain archetype, 85-86 
primary chlorin chains, 86-88 
Redox proteins, natural design, length 
management, 75-78 
Rhodopsin 
crystal structure, 249-253 
disk membrane structure, 274-277 
dodecyl maltoside solution 
C1 dynamics, 272-274 
C1 structure, 261-263 
C2 dynamics, 272-274 
C2 structure, 264-265 
C3 dynamics, 272-274 
C3 structure, 265-267 
C-terminal tail dynamics, 272-274 
C-terminal tail structure, 258-260 
disulfide cross-linking, 257-258 
H8 dynamics, 272-274 
H8 structure, 269-272 
site-directed spin labeling, 253-257 
sulfhydryl reactivity, 257-258 
interhelical cross-linking experiments, 
281-283 
mechanism of activation, 285-286 
membrane-aqueous interface location, 
274-277 
overview, 243-249 
R1 mobility, 277-281 
structure-function relationships, 
283-285 
Ribosome-Sec61 translocon complex, 
membrane protein assembly, 3-4 
Rps. viridis, heme chain archetype, 
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S3-S4 linker, ion channel residue 
movement, 226-227 
Salt bridges, transmembrane helices, 
41-42 
SDSL, see Site-directed spin labeling 
Sec61 translocon, membrane protein 
assembly, 2 
Second electron transfer, proton coupling, 
purple bacteria Qa/Qpg, 98-100 
SecYEG, membrane protein assembly, 2, 
9-10 
SecYEG-YidC translocon, membrane 
protein assembly, 9-10 
Signal recognition particle, 3 
Simulations 
gramicidin A Brownian dynamics, 94 
photosystems with tunneling theory, 
90-92 
Site-directed spin labeling 
KcsA, 229-231 
rhodopsin, 274-275 
rhodopsin in dodecyl maltoside solution, 
253-257 
rhodopsin structure-function 
relationships, 283-285 
Skp, -barrel folding and stability, 62 
SQR, see Succinate:quinone reductase 
SRP, see Signal recognition particle 
Streptomyces lividans ET channel 
helical movement during gating, 
231-234 
ionic permeation and gating model, 
228-229 
selectivity filter in mechanism, 
234-238 
site-directed spin label analysis, 
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Structure-function relationships 
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1-2 
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basic reaction, 131 
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structure, 134 
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Succinate:quinone reductase, 132 
Sulfhydryl reactivity, rhodopsin, 257-258 
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Tat system, see Twin-arginine translocation 
system 
Thermodynamic studies 
mechanosensitive ion channels, 180-181 
transmembrane helices 
experiments, 25-28 
models, 24-25 
TIM, f-barrel structure, 56 
TIM22 pathway, membrane protein 
assembly, 11 
TM helices, see Transmembrane helices 
TolC, -barrel structure, 54-55 
TOM-TIM23 pathway, membrane protein 
assembly, 11 
Transmembrane helices 
hydrogen bonding 
nitrogen and oxygen bonds, 37-39 
nontraditional hydrogen bonds, 39-41 
salt bridges, 41-42 
lipid interactions 
curvature, 33-35 
hydrophobic matching, 30-32 
lateral capillary forces, 32-33 
overview, 29-30 
specific interactions, 35-36 
loop construction comparison, 28-29 
MscL, 189-192 
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packing, 21-23 
residue environment preferences, 23 
rhodopsin activation mechanism, 285-286 
rhodopsin C1 in solution, 261-263 
rhodopsin C2 in solution, 264-265 
rhodopsin C3 in solution, 265-267 
rhodopsin H8 in solution, 269-272 
rhodopsin interhelical cross-linking 
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rhodopsin R1 mobility, 277-281 
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van der Waals interactions, 36-37 
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Transmembrane proteins, lipid interactions 
curvature, 33-35 
hydrophobic matching, 30-32 
lateral capillary forces, 32-33 
overview, 29-30 
specific interactions, 35-36 
Tunneling theory, photosystem simulation, 
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Twin-arginine translocation system, 
membrane protein assembly, 2 
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Ubiquinol, binding site modeling, 173-174 
Ubiquinone binding sites 

cytochrome bos, 170-171 

membrane proteins, 169-170 
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Van der Waals interactions, transmembrane 
helices, 36-37 
Visual signal transduction, 243-249 
Voltage sensor, ion channels 
charged residue movement overview, 
217-218 
charge to open channel, 213-215 
electric field, 221-222 
gating charge residues, 215-217 
model, 227-228 
overview, 211-213 
residue distance changes, 222-226 
residue exposure probing, 218-221 
S3-S4 linker role, 226-227 


W 


Water 
GlpF glycerol selectivity comparison, 
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passage through GlpF channel, 305-307 
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coupled electron/proton transfer, 
145-146 
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